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Unified field Theory 



In the beginning there was Aristotle 

And objects at rest tended to remain at rest 

And objects in motion tended to come to rest 

And soon everything was at rest 

And God saw that it was boring. 

Then God created Newton 

And objects at rest tended to remain at rest 

But objects in motion tended to remain in motion 

And energy was conserved and momentum was conserved 

and matter was conserved 
And God saw that it was conservative. 

Then God created Einstein 
And everything was relative 
And fast things became short 
And straight things became curved 
And the universe was filled with inertial frames 
And God saw that it was relatively general 
but some of it was especially relative. 

Then God created Bohr 
And there was the Vrinciple 
And the principle was Quantum 
And all things were quantized 
But some things were still TZelative 
And God saw that it was confusing. 

Then God was going to create M. 

And M. would have unified 

And M. would have fielded a theory 

And all would have been one 

But it was the seventh day 

And God rested 

And objects at rest tend to remain at rest. 

Adapted from a poem by Tim Joseph 
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PREFACE 

The idea of complex spacetime as a unification of spacetime and classi- 
cal phase space, suitable as a possible geometric basis for the synthesis 
of Relativity and quantum theory, first occured to me in 1966 while 
I was a physics graduate student at the University of Wisconsin. In 
1971, during a seminar I gave at Carleton University in Canada, it 
was pointed out to me that the formalism I was developing was re- 
lated to the coherent-state representation, which was then unknown 
to me. This turned out to be a fortunate circumstance, since many 
of the subsequent developments have been inspired by ideas related 
to coherent states. My main interest at that time was to formulate 
relativistic coherent states. 

In 1974, I was struck by the appearance of tube domains in ax- 
iomatic quantum field theory. These domains result from the analytic 
continuation of certain functions (vacuum expectaion values) associ- 
ated with the theory to complex spacetime, and powerful methods 
from the theory of several complex variables are then used to prove 
important properties of these functions in real spacetime. However, 
the complexified spacetime itself is usually not regarded as having 
any physical significance. What intrigued me was the possibility that 
these tube domains may, in fact, have a direct physical interpretation 
as (extended) classical phase spaces. If so, this would give the idea of 
complex spacetime a firm physical foundation, since in quantum field 
theory the complexification is based on solid physical principles. It 
could also show the way to the construction of relativistic coherent 
states. These ideas were successfully worked out in 1975-76, culmi- 
nating in a mathematics thesis in 1977 at the University of Toronto 
entitled "Phase-Space Approach to Relativistic Quantum Mechan- 
ics." 

Up to that point, the theory could only describe free particles. 
The next goal was to see how interactions could be added. Some 
progress in this direction was made in 1979-80, when a natural way 
was found to extend gauge theory to complex spacetime. Further 
progress came during my sabbatical in 1985-86, when a method was 
developed for extending quantized fields themselves (rather than their 
vacuum expectation values) to complex spacetime. These ideas have 
so far produced no "hard" results, but I believe that they are on the 
right path. 

Although much work remains to be done, it seems to me that 
enough structure is now in place to justify writing a book. I hope 
that this volume will be of interest to researchers in theoretical and 
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mathematical physics, mathematicians interested in the structure of 
fundamental physical theories and assorted graduate students search- 
ing for new directions. Although the topics are fairly advanced, much 
effort has gone into making the book self-contained and the subject 
matter accessible to someone with an understanding of the rudiments 
of quantum mechanics and functional analysis. 

A novel feature of this book, from the point of view of math- 
ematical physics, is the special attention given to "signal analysis" 
concepts, especially time-frequency localization and the new idea of 
wavelets. It turns out that relativistic coherent states are similar to 
wavelets, since they undergo a Lorentz contraction in the direction 
of motion. I have learned that engineers struggle with many of the 
same problems as physicists, and that the interplay between ideas 
from quantum mechanics and signal analysis can be very helpful to 
both camps. For that reason, this book may also be of interest to 
engineers and engineering students. 

The contents of the book are as follows. In chapter 1 the simplest 
examples of coherent states and time-frequency localization are intro- 
duced, including the original "canonical" coherent states, windowed 
Fourier transforms and wavelet transforms. A generalized notion of 
frames is defined which includes the usual (discrete) one as well as 
continuous resolutions of unity, and the related concept of reproduc- 
ing kernels is discussed. 

In chapter 2 a new, algebraic approach to orthonormal bases of 
wavelets is formulated. An operational calculus is developed which 
simplifies the formalism considerably and provides insights into its 
symmetries. This is used to find a complex structure which explains 
the symmetry between the low- and the high-frequency filters in 
wavelet theory. In the usual formulation, this symmetry is clearly 
evident but appears to be accidental. Using this structure, complex 
wavelet decompositions are considered which are analogous to ana- 
lytic coherent-state representations. 

In chapter 3 the concept of generalized coherent states based on 
Lie groups and their homogeneous spaces is reviewed. Considerable 
attention is given to holomorphic (analytic) coherent-state represen- 
tations, which result from the possibility of Lie group complexifica- 
tion. The rotation group provides a simple yet non-trivial proving 
ground for these ideas, and the resulting construction is known as the 
"spin coherent states." It is then shown that the group associated 
with the Harmonic oscillator is a weak contraction limit (as the spin 
s — > oo) of the rotation group and, correspondingly, the canonical co- 
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herent states are limits of the spin coherent states. This explains why 
the canonical coherent states transform naturally under the dynamics 
generated by the harmonic oscillator. 

In chapter 4, the interactions between phase space, quantum me- 
chanics and Relativity are studied. The main ideas of the phase- 
space approach to relativistic quantum mechanics are developed for 
free particles, based on the relativistic coherent-state representations 
developed in my thesis. It is shown that such representations admit 
a covariant probabilistic interpretation, a feature absent in the usual 
spacetime theories. In the non-relativistic limit, the representations 
are seen to "contract" smoothly to representations of the Galilean 
group which are closely related to the canonical coherent-state rep- 
resentation. The Gaussian weight functions in the latter are seen to 
emerge from the geometry of the mass hyperboloid. 

In chapter 5, the formalism is extended to quantized fields. The 
basic tool for this is the Analytic-Signal transform, which can be ap- 
plied to an arbitrary function on IR n to give a function on (D n which, 
although not in general analytic, is "analyticity-friendly" in a cer- 
tain sense. It is shown that even the most general fields satisfying 
the Wightman axioms generate a complexification of spacetime which 
may be interpreted as an extended classical phase space for certain 
special states associated with the theory. Coherent-state representa- 
tions are developed for free Klein-Gordon and Dirac fields, extending 
the results of chapter 4. The analytic Wightman two-point functions 
play the role of reproducing kernels. Complex-spacetime densities of 
observables such as the energy, momentum, angular momentum and 
charge current are seen to be regularizations of their counterparts 
in real spacetime. In particular, Dirac particles do not undergo their 
usual Zitterbewegung. The extension to complex spacetime separates, 
or polarizes, the positive- and negative-frequency parts of free fields, 
so that Wick ordering becomes unnecessary. A functional-integral 
representation is developed for quantized fields which combines the 
coherent-state representations for particles (based on a finite number 
of degrees of freedom) with that for fields (based on an infinite number 
of degrees of freedom). 

In chapter 6 we give a brief account of some ongoing work, begin- 
ning with a review of the idea of holomorphic gauge theory. Whereas 
in real spacetime it is not possible to derive gauge potentials and 
gauge fields from a (fiber) metric, we show how this can be done in 
complex spacetime. Consequently, the analogy between General Rel- 
ativity and gauge theory becomes much closer in complex spacetime 
than it is in real spacetime. In the "holomorphic" gauge class, the 
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relation between the (non-abelian) Yang-Mills field and its potential 
becomes linear due to the cancellation of the non-linear part which 
follows from an integrability condition. Finally, we come full circle by 
generalizing the Analytic-Signal transform and pointing out that this 
generalization is a higher-dimensional version of the wavelet trans- 
form which is, moreover, closely related to various classical transforms 
such as the Hilbert, Fourier-Laplace and Radon transforms. 

I am deeply grateful to G. Emch for his continued help and en- 
couragement over the past ten years, and to John Klauder and Ray 
Streater for having read the manuscript carefully and made many 
invaluable comments, suggestions and corrections. (Any remaining 
errors are, of course, entirely my responsibility.) I also thank D. Buch- 
holtz, F. Doria, D. Finch, S. Helgason, I. Kupka, Y. Makovoz, J. E. 
Marsden, M. O'Carroll, L. Rosen, M. B. Ruskai and R. Schor for mis- 
cellaneous important assistance and moral support at various times. 
Finally, I am indebted to L. Nachbin, who first invited me to write 
this volume in 1981 (when I was not prepared to do so) and again in 
1985 (when I was), and who arranged for a tremendously interesting 
visit to Brazil in 1982. Quero tambem agradecer a todos os meus 
colegas Brasileiros! 



Suggestions to the Reader 

The reader primarily interested in the phase-space approach to 
relativistic quantum theory may on first reading skip chapters 1-3 
and read only chapters 4-6, or even just chapter 4 and either chap- 
ters 5 or 6, depending on interest. These chapters form a reasonably 
self-contained part of the book. Terms defined in the previous chap- 
ters, such as "frame," can be either ignored or looked up using the 
extensive index. The index also serves partially as a glossary of fre- 
quently used symbols. The reader primarily interested in signal analy- 
sis, time-frequency localization and wavelets, on the other hand, may 
read chapters 1 and 2 and skip directly to sections 5.2 and 6.2. The 
mathematical reader unfamiliar with the ideas of quantum mechanics 
is urged to begin by reading section 1.1, where some basic notions are 
developed, including the Dirac notation used throughout the book. 
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1.1. Preliminaries 

In this section we establish some notation and conventions which will 
be followed in the rest of the book. We also give a little background on 
the main concepts and formalism of non-relativistic and relativistic 
quantum mechanics, which should make this book accessible to non- 
specialists. 

1. Spacetime and its Dual 
In this book we deal almost exclusively with flat spacetime, though 
we usually let space be IR S instead of IR 3 , so that spacetime becomes 
X = IR S+1 . The reason for this extension is, first of all, that it involves 
little cost since most of the ideas to be explored here readily generalize 
to IR S+1 , and furthermore, that it may be useful later. Many models 
in constructive quantum field theory are based on two- or three- 
dimensional spacetime, and many currently popular attempts to unify 
physics, such as string theories and Kaluza-Klein theories, involve 
spacetimes of higher dimensionality than four or (on the string world- 
sheet) two-dimensional spacetimes. An event x E X has coordinates 



where x° = t is the time coordinate and x J are the space coordinates. 
Greek indices run from to s, while latin indices run from 1 to s. If 
we think of translation vector, then X is the vector space of 

all translations in spacetime. Its dual X* is the set of all linear maps 
k: X — > IR. By linearity, the action of k on x (which we denote by kx 
instead of k(x)) can be written as 




o 




s 




(2) 



where we adopt the Einstein summation convention of automatically 
summing over repeated indices. Usually there is no relation between 
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x and k other than the pairing (x, k) \— > /ex. But suppose we are given 
a scalar product on X, 



where {g^v) is a non-degenerate matrix. Then each x in X defines 
a linear map x*: X — > IR by = x • x', thus giving a map 

*:X^X*,with 



= x„ =g llv x». (4) 

Since <7 MI/ is non-degenerate, it also defines a scalar product on X* , 
whose metric tensor is denoted by g^ v . The map x — > x* establishes 
an isomorphism between the two spaces, which we use to identify 
them. If x denotes a set of inertial coordinates in free spacetime, then 
the scalar product is given by 

g^v = diag(c 2 , -1, -1, • • • , -1) 

where c is the speed of light. X, together with this scalar product, is 
called Minkowskian or Lorentzian spacetime. 

It is often convenient to work in a single space rather than the 
dual pair X and X*. Boldface letters will denote the spatial parts of 
vectors in X* . Thus x = (t, — x), k = (ko, k) and 

x ■ x' = c 2 tt' — x ■ x' and kx = k ■ x* = k t — k ■ x, (5) 
where x • x' and k • x denote the usual Euclidean inner products in 

2. Fourier Transforms 
The Fourier transform of a function /: X — > (D (which, to avoid an- 
alytical subtleties for the present, may be assumed to be a Schwartz 
test function; see Yosida [1971]) is a function /: X* — > (C given by 

/>)= / dxe^ ikx f(x) (6) 
Jx 

where dx = dtd s x is Lebesgue measure on X . f can be reconstructed 
from / by the inverse Fourier transform, denoted by " and given by 

f(x)= [ dke- 2 ™ k *f(k) = (fy(x), (7) 
Jx* 
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where dk = dk d s k denotes Lebesgue measure on X* pz X. Note that 
the presence of the 2n factor in the exponent avoids the usual need 
for factors of (2tt)~( s+1 ^ 2 or (27r) _s_1 in front of the integrals. Phys- 
ically, k represents a wave vector: ko = v is a frequency in cycles per 
unit time, and kj is a wave number in cycles per unit length. Then the 
interpretation of the linear map k: X — > IR is that 2nkx is the total 
radian phase gained by the plane wave g(x') = exp(—2irikx') through 
the spacetime translation x, i.e. 2nk "measures" the radian phase 
shift. Now in pre-quantum relativity, it was realized that the energy 
E combines with the momentum p to form a vector p = = (E, p) 
in X* . Perhaps the single most fundamental difference between clas- 
sical mechanics and quantum mechanics is that in the former, matter 
is conceived to be made of "dead sets" moving in space while in the 
latter, its microscopic structure is that of waves descibed by complex- 
valued wave functions which, roughly speaking, represent its distribu- 
tion in space in probabilistic terms. One important consequence of 
this difference is that while in classical mechanics one is free to spec- 
ify position and momentum independently, in quantum mechanics a 
complete knowledge of the distribution in space, i.e. the wave func- 
tion, determines the distribution in momentum space via the Fourier 
transform. The classical energy is re-interpreted as the frequency of 
the associated wave by Planck's Ansatz, 

p = E = 2-nhv (8) 

where % is Planck's constant, and the classical momentum is re- 
interpreted as the wave-number vector of the associated wave by De 
Broglie's relation, 

p = 2nhk. (8') 

These two relations are unified in relativistic terms as p^ = 2iiTik^. 
Since a general wave function is a superposition of plane waves, each 
with its own frequency and wave number, the relation of energy and 
momentum to the the spacetime structure is very different in quantum 
mechanics from what is was in classical mechanics: They become 
operators on the space of wave functions: 

(PJ)(x) = J dkp,e- 2 ^ kx f{k) = ih-^fiz), (9) 



or, in terms of x*, 
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P = ih— and P k = -ih — . (9') 

This is, of course, the source of the uncertainty principle. In terms 
of energy-momentum, we obtain the "quantum-mechanical" Fourier 
transform and its inverse, 



f(p) = [ dxe^/ h f(x) 
Jx 

f(x) = (2nh)- s ~ 1 [ dpe-to/tfo). 
Jx* 



(10) 



If f(x) satisfies a differential equation, such as the Schrodinger 
equation or the Klein-Gordon equation, then f(p) is supported on 
an s-dimensional submanifold P of X* (a paraboloid or two-sheeted 
hyperboloid, respectively) which can be parametrized by p G IR S . We 
will write the solution as 

f{x) = [2nh)~ s J d»(p) e-^/ h f(p), (11) 

where f(p) is, by a mild abuse of notation, the "restriction" of / to 

P (actually, \f(p)\ 2 is a density on P) and dfj,(p) = p(p)d s p is an 
appropriate invariant measure on P. For the Schrodinger equation 
p(p) = 1, whereas for the Klein-Gordon equation, p(p) = |po| _1 - 
Setting t = then shows that f(p) is related to the initial wave 
function by 

/(x,0)=(p( P )/)"(x), (12) 

where now " " " denotes the the s-dimensional inverse Fourier trans- 
form of the function JoqP~ IR s . 

We will usually work with "natural units," i.e. physical units 
so chosen that % = c = 1. However, when considering the non- 
relativistic limit (c — > oo) or the classical limit (ft — > 0), c or % will be 
re-inserted into the equations. 

3. Hilbert Space 

Inner products in Hilbert space will be linear in the second factor and 
antilinear in the first factor. Furthermore, we will make some discrete 
use of Dirac's very elegant and concise bra-ket notation, favored by 
physicists and often detested or misunderstood by mathematicians. 
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As this book is aimed at a mixed audience, I will now take a few para- 
graphs to review this notation and, hopefully, convince mathemati- 
cians of its correctness and value. When applied to coherent-state 
representations, as opposed to representations in which the position- 
or momentum operators are diagonal, it is perfectly rigorous. (The 
bra-ket notation is problematic when dealing with distributions, such 
as the generalized eigenvectors of position or momentum, since it tries 
to take the "inner products" of such distributions.) 

Let 7i be an arbitrary complex Hilbert space with inner prod- 
uct (•,•). Each element / G T~t defines a bounded linear functional 
/*: H^Vby 

f*(g) = (f,g)- (13) 

The Riesz representation theorem guarantees that the converse is also 
true: Each bounded linear functional L : 7i — > (E has the form L = /* 
for a unique / G TL. Define the bra (/| corresponding to / by 

(/|=/*:W-(C. (14) 

Similarly, there is a one-to-one correspondence between vectors g G H 
and linear maps 

\g):<E^H (15) 

defined by 



\g)(X) = Xg, Ag(E, (16) 

which will be called kets. Thus elements of 7i will be denoted alter- 
natively by g or by \g). We may now consider the composite map 
bra-ket 



(/|<7>:<C^<C, (17) 

given by 

(f\g)(\) = r(\g) = \f(g) 
= Mf,g). 

Therefore the "bra-ket" map is simply the multiplication by the in- 
ner product (/, g) (whence it derives its name). Henceforth we will 
identify these two and write (/| g) for both the map and the inner 
product. The reverse composition 
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\g)(f\:H^H (19) 
may be viewed as acting on kets to produce kets: 

\g)(f\(\h)) = \g)((f\h)). (20) 

To illustrate the utility of this notation, as well as some of its 
pitfalls, suppose that we have an orthonormal basis {g n } in H- Then 
the usual expansion of an arbitrary vector / in Ti takes the form 

f = Y.9n{9n\f) = Y,\9n){9n\f), (21) 

n n 

from which we have the "resolution of unity" 

J2\9n)(9n\=I (22) 

n 

where I is the identity on Ti and the sum converges in the strong 
operator topology. If {h n } is a second orthonormal basis, the relation 
between the expansion coefficients in the two bases is 



(h n \f) = (h n \If)=J2(hn\9m)(9m\f). (23) 

m 

In physics, vectors such as g n are often written as |n), which 
can be a source of great confusion for mathematicians. Furthermore, 
functions in L 2 (IR S ), say, are often written as f(x) = (x\f), with 
(x\x f ) = S(x — x'), as though the | x )'s formed an orthonormal basis. 
This notation is very tempting; for example, the Fourier transform is 
written as a "change of basis," 

(k\f)= J dx{k\x)(x\f) (24) 

with the "transformation matrix" (k\x) = exp(2nikx). One of the 
advantages of this notation is that it permits one to think of the 
Hilbert space as "abstract," with (g n \ f)-,(h n \ f),(x \ f) and ( k | / ) 
merely different "representations" (or "realizations" ) of the same vec- 
tor /. However, even with the help of distribution theory, this use 
of Dirac notation is unsound, since it attempts to extend the Riesz 
representation theorem to distributions by allowing inner products of 
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them. (The "vector" (x\ is a distribution which evaluates test func- 
tions at the point x; as such, | x' ) does not exist within modern-day 
distribution theory.) We will generally abstain from this use of the 
bra-ket notation. 

Finally, it should be noted that the term "representation" is used 
in two distinct ways: (a) In the above sense, where abstract Hilbert- 
space vectors are represented by functions in various function spaces, 
and (b) in connection with groups, where the action of a group on a 
Hilbert space is represented by operators. 

This notation will be especially useful when discussing frames, of 
which coherent-state representations are examples. 



1.2. Canonical Coherent States 

We begin by recalling the original coherent-state representations 
(Bargmann [1961], Klauder [1960, 1963a, b], Segal [1963a]). Con- 
sider a spinless non-relativistic particle in IR S (or s/3 such particles 
in IR 3 ), whose algebra of observables is generated by the position op- 
erators Xk and momentum operators P k , k = 1,2, . . . s. These satisfy 
the "canonical commutation relations" 

[x k ,xt\ = o, [p k ,n] = o, [x k ,n] = i6 k ii, (i) 

where / is the identity operator. The operators — iX k , —iPk and —il 
together form a real Lie algebra known as the Heisenberg algebra, 
which is irreducibly represented on L 2 (IR S ) by 

X k f(x) = x k f(x), P k f(x) = -i — f(x), (2) 

ox k 

the Schrddinger representation. 

As a consequence of the above commutation relations between X k 
and P k , the position and momentum of the particle obey the Heisen- 
berg uncertainty relations, which can be derived simply as follows. 
The expected value, upon measurement, of an observable represented 
by an operator F in the state represented by a wave function f(x) 
with ll/H = 1 (where || ■ || denotes the norm in L 2 (IR S )) is given by 



(F) = (f\Ff). 



(3) 
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In particular, the expected position- and momentum coordinates of 
the particle are (X k ) and (P k ). The uncertainties Ax k and Ap k in 
position and momentum are given by the variances 



tf Xh = ({x k -{x k )Y) = {xl)-{x k , 

A Pk = ((p k -(p k )r) = (Pi)-( Pk ^ 



Choose an arbitrary constant b with units of area (square length) and 
consider the operators 

d 

A k =X k + ibP k =x k + b—. (5) 

ox k 

Notice that although A k is non-Hermitian, it is real in the Schrodin- 
ger representation. Let 

z k = (f\A k f) = (X k )+ib(P k ), (6) 

where z k denotes the complex-conjugate of z k . Then for SA k = A k — 
z k I we have ( 5A k ) = and 

0<\\5A k ff = A Xk +b 2 A Pk -b. (7) 

The right-hand side is a quadratic in 6, hence the inequality for all b 
demands that the discriminant be non-positive, giving the uncertainty 
relations 

A Xfe A Pfc > i (8) 

Equality is attained if and only if 5A k f = 0, which shows that the 
only minimum-uncertainty states are given by wave functions f(x) 
satisfying the eigenvalue equations 

A k f=(x k + b—)f = z k f (9) 

%k 

for some real number b (which may actually depend on k) and some 
z G (D s . But square-integrable solutions exist only for b > 0, and then 
there is a unique solution (up to normalization) \ z for each z G (E s . 
To simplify the notation, we now choose 6=1. Then A k and A* k 
satisfy the commutation relations 

[A k ,A L ]=0, [A kt At] = 2S kl I, (10) 
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and Xz is given by 

Xz(x') = Nex.p[-z 2 /4 + z ■ x - x' 2 /2], (11) 

where the normalization constant is chosen as N = 7r _s / 4 , so that 
IIXzll = 1 for z = 0. Here z 2 is the (complex) inner product of z with 
itself. Clearly Xz is in L 2 (IR S ), and if z = x — ip, then 

(X k ) = x k and (P k ) = p k (12) 

in the state given by Xz- The vectors Xz are known as the canon- 
ical coherent states. They occur naturally in connection with the 
harmonic oscillator problem, whose Hamiltonian can be cast in the 
form 

H = -L (P 2 + m 2 u 2 X 2 ) = \mu 2 A* ■ A + ^ (13) 

with 

A fc = X fc + iP k /muj (14) 

(thus 6 = 1/mu). They have the remarkable property that if the ini- 
tial state is then the state at time t is Xz(t) where z(t) is the orbit 
in phase space of the corresponding classical harmonic oscillator with 
initial data given by z. These states were discovered by Schrodinger 
himself [1926], at the dawn of modern quantum mechanics. They were 
further investigated by Fock [1928] in connection with quantum field 
theory and by von Neumann [1931] in connection with the quantum 
measurement problem. Although they span the Hilbert space, they 
do not form a basis because they possess a high degree of linear de- 
pendence, and it is not easy to find complete, linearly independent 
subsets. For this reason, perhaps, no one seemed to know quite what 
to do with them until the early 1960's, when it was discovered that 
what really mattered was not that they form a basis but what we 
shall call a generalized frame. This allows them to be used in gen- 
erating a representation of the Hilbert space by a space of analytic 
functions, as explained below. The frame property of the coherent 
states (which will be studied and generalized in the following sections 
and in chapter 3) was discovered independently at about the same 
time by Klauder, Bargmann and Segal. Glauber [1963a,b] used these 
vectors with great effectiveness to extend the concept of optical co- 
herence to the domain of quantum electrodynamics, which was made 
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necessary by the discovery of the laser. He dubbed them "coherent 
states," and the name stuck to the point of being generic. (See also 
Klauder and Sudarshan [1968].) Systems of vectors now called "co- 
herent" may have nothing to do with optical coherence, but there is 
at least one unifying characteristic, namely their frame property (next 
section) . 

The coherent-state representation is now defined as follows: Let 
T be the space of all functions 



f(z) = ( Xz \f) = J d s x' Xz {x')f{x') 

= N J d s x' exp[-z 2 /2 + z ■ x' - x' 2 /2]f(x') 



(15) 



where / runs through L 2 (IR S ). Because the exponential decays ra- 
pidly in x' , f is entire in the variable z G (E s . Define an inner product 
on T by 



(f\~g)T= / du.(z)f(z)~g(z), (16) 
where z = x — ip and 

dp(z) = (2n)- s exp(-z ■ z/2) d s x d s p. (17) 

Then we have the following theorem relating the inner products in 
L 2 (M S ) and T. 

Theorem 1.1. Let f,g G L 2 (IR S ) and let f,g be the corresponding 
entire functions in T . Then 

(f\g)r = {f\g)L*- (is) 

Proof. To begin with, assume that / is in the Schwartz space iS(IR s ) 
of rapidly decreasing smooth test functions. For z = x — ip, we have 

Xz(x') = Nexp[-z 2 /2 + x 2 /2 - (x' - x) 2 /2 + ip ■ x% 

hence 



f(x-ip) = Nexp[(x 2 +p 2 )/4 + ipx/2](exp[-(x' -x) 2 /2]f)\p) (19) 
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where " denotes the Fourier transform with respect to x' . Thus by 
Plancherel's theorem (Yosida [1971]), 



(2tt)- s f rf s pexp(V/2)|/>-^p)| 2 

J , (20) 

= N 2 exp(x 2 /2) / d s x'exp[-(x' - x) 2 ] \f(x')\ 2 . 

Therefore 

/ dfi(z) \f(z) | 2 

= N 2 J d s x J d s x' exp[-(x' - xf] \f(x') | 2 (21) 
= J d s x'\f{x')\ 2 
after exchanging the order of integration. This proves that 

\\i\\%=\\f\\h (22) 

for / G 5(IR S ), hence by continuity also for arbitrary / G L 2 (IR S ). By 
polarization the result can now be extended from the norms to the 
inner products. I 

The relation / <-> / can be summarized neatly and economically 
in terms of Dirac's bra-ket notation. Since 



/(*) = (Xz\f) = (/|X Z >, (23) 
theorem 1 can be restated as 

/ d»(z)(f\ Xz )( Xz \g) = (f\g). (24) 
Dropping the bra (/| and ket \g), we have the operator identity 

/ dfjL(z)\xz)iXz\ = I, (25) 
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where I is the identity operator on L 2 (IR S ) and the integral converges 
at least in the sense of the weak operator topology* 1.6. ctS ct quadratic 
form. In Klauder's terminology, this is a continuous resolution of 
unity. A general operator B on L 2 (IR S ) can now be expressed as an 

integral operator B on T as follows: 



Particularly simple representations are obtained for the basic po- 
sition- and momentum operators. We get 



(Bf)(z) = (Bf)~(z) 
= (Xz\Bf) 
= (Xz\BIf) 




(26) 



X kXz (x') 



x' k Xz(x') 




(27) 



PkXz 



i(A k - X k )xz 




thus 




(28) 




* As will be shown in a more general context in the next section, un- 
der favorable conditions the integral actually converges in the strong 
operator topology. 
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Hence X\. and Pk can be represented as differential rather than inte- 
gral operators. 

As promised, the continuous resolution of the identity makes it 
possible to reconstruct f G L 2 (IR S ) from its transform / G T\ 

f = If= [ dix(z)\xz)(Xz\f), (29) 

that is, 

f(x')= [ d^z) Xz (x')f(z). (30) 

J(£ a 

Thus in many respects the coherent states behave like a basis for 
L 2 (IR S ). But they differ from a basis in at least one important respect: 
They cannot all be linearly independent, since there are uncountably 
many of them and L 2 (IR S ) (and hence also J-) is separable. In partic- 
ular, the above reconstruction formula can be used to express Xz m 
terms of all the x™'s: 

Xz = / dfi(w)\xw){Xw\Xz)- (31) 

In fact, since entire functions are determined by their values on some 
discrete subsets V of(D s , we conclude that the corresponding subsets 
of coherent states {x z \ z G V} are already complete since for any 
function / orthogonal to them all, f(z) =0 for all z G V and hence 
/ = 0, which implies / = a.e. For example, if V is a regular lattice, a 
necessary and sufficient condition for completesness is that V contain 
at least one point in each Planck cell (Bargmann et al., [1971]), in 
the sense that the spacings Axk and Apk of the lattice coordinates 
Zk = Xk + ipk satisfy AxkApk < 2-kTi = 2tv. It is no accident that 
this looks like the uncertainty principle but with the inequality going 
"the wrong way." The exact coefficient of h is somewhat arbitrary 
and depends on one's definition of uncertainty; it is possible to de- 
fine measures of uncertainty other than the standard deviation. (In 
fact, a preferable — but less tractable — definition of uncertainty uses 
the notion of entropy, which involves all moments rather than just 
the second moment. See Bialynicki-Birula and Mycielski [1975] and 

Zakai [I960].) The intuitive explanation is that if / gets "sampled" 
at least once in every Planck cell, then it is uniquely determined since 
the uncertainty principle limits the amount of variation which can 
take place within such a cell. Hence the set of all coherent states 
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is overcomplete. We will see later that reconstruction formulas exist 
for some discrete subsystems of coherent states, which makes them 
as useful as the continuum of such states. This ability to synthesize 
continuous and discrete methods in a single representation, as well as 
to bridge quantum and classical concepts, is one more aspect of the 
appeal and mystery of these systems. 



1.3. Generalized Frames and Resolutions of Unity 

Let M be a set and fi be a measure on M (with an appropriate cr- 
algebra of measurable subsets) such that {M, /j,} is a tr-finite measure 
space. Let Ti be a Hilbert space and h m G Ti be a family of vectors 
indexed by m G M. 
Definition. The set 

n M = {h m \meM} (1) 

is a generalized frame in 7i if 

1. the map h: m i— > h m is weakly measurable, i.e. for each / G Ti 
the function f(m) = ( h m \ f ) is measurable, and 

2. there exist constants < A < B such that 

A\\ff<[ df,(m)\(h m \f)\ 2 <B\\ff V/GW. (2) 

7^m is a frame (see Young [1980] and Daubechies [1988a]) in the spe- 
cial case when M is countable and \x is the counting measure on M 
(i.e., it assigns to each subset of M the number of elements contained 
in it). In that case, the above condition becomes 

^ll/H 2 < E l(^|/)| 2 <S||/|| 2 V/GW. (3) 

We will henceforth drop the adjective "generalized" and simply speak 
of "frames." The above case where M is countable will be refered to 
as a discrete frame. 

If A = B, the frame Hm is called tight. The coherent states of the 
last section form a tight frame, with M = (E s ,<i / u(m) = dfi(z), h m = 
Xz and A = B = 1. 

Given a frame, let T be the map taking vectors in Ti to functions 
on M defined by 
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(Tf)(m) = (h m \f) = f(m), feH. (4) 

Then the frame condition states that Tf is square-integrable with 
respect to d/j, so that T defines a map 

T:U^ L 2 (dy), (5) 

with 

^ll/ll^r/III.^^SH/ll 2 . (6) 
The frame property can now be stated in operator form as 

AI < T*T < BI, (7) 
where I is the identity on Ti. In bra-ket notation, 

G = T*T= [ dn(m)\h m )(h m \, (8) 

J M 

where the integral is to be interpreted, initially, as converging in the 
weak operator topology, i.e. as a quadratic form. For a measurable 
subset N of M, write 

G(N) = [ dfi(m)\h m )(h m \. (9) 

Proposition 1.2. If the integral G(N) converges in the strong op- 
erator topology of H whenever N has finite measure, then so does the 
complete integral representing G = T*T. 

Proof*. Since M is cr-finite, we can choose an increasing sequence 
{M n } of sets of finite measure such that M = U n M n . Then the corre- 
sponding sequence of integrals G n forms a bounded (by G) increasing 
sequence of Hermitian operators, hence converges to G in the strong 
operator topology (see Halmos [1967], problem 94). I 

If the frame is tight, then G = AI and the above gives a resolution 
of unity after dividing by A. For non-tight frames, one generally has 
to do some work to obtain a resolution of unity. The frame condition 
means that G has a bounded inverse, with 

B~ X I < G~ x < A~ X I. (10) 



* I thank M. B. Ruskai for suggesting this proof. 
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Given a function g(m) in L 2 (dfi), we are interested in answering the 
following two questions: (a) Is g = Tf for some / G HP. (b) If so, 
then what is /? In other words, we want to: 

(a) Find the range 9?x C L 2 (d/j,) of the map T. 

(b) Find a left inverse £ of T, which enables us to reconstruct / from 
Tf by f = ST f. 

Both questions will be answered if we can explicitly compute G~ x . 
For let 

P = TG _1 T*: L 2 (c^) -> L 2 (c^). (11) 
Then it is easy to see that 

(a) P* = P 

(b) P 2 = P (12) 

(c) PT = T. 

It follows that P is the orthogonal projection onto the range of T, 

P: L 2 (dfi) ^3? T , (13) 

for if (7 = T/ for some / in 7i, then = PTf = Tf = g, and 
conversely if for some g we have Pg = g, then g = T(G~ l T* g) = Tf. 
Thus is a closed subspace of L 2 (dn) and a function g e L (d/j,) is 
in 3?t if and only if 

<7(m) = (P<7)(m) = (TG" 1 ^) (m) 
= (h m \G- 1 T*g) n 
= (TG~ 1 h m \g)L 2 (dfi) 



J 



e^(m') (TG-^h m ){m')g{m') (14) 



d//(m') (hm> \G 1 h m )g(m') 



M 



The function 



dfj,(m') (hm\G 1 h ml )g(m). 

M 



K(m,m') = {h m \G- l h ml ) (15) 
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therefore has a property similar to the Dirac 5-function with respect 
to the measure dfi, in that it reproduces functions in 9ft j-. But it differs 
from the 5-function in some important respects. For one thing, it is 
bounded by 

\K(m,m')\ = \(h m \G- 1 h' m )\ 

< WG- 1 ]] \\h m \\ \\h m ,\\ (16) 

< A _1 ||/i m || \\h m >\\ < oo 

for all m and m' . Furthermore, the "test functions" which K(m,m') 
reproduces form a Hilbert space and K(m, m') defines an integral 
operator, not merely a distribution, on 9?x- In the applications to 
relativistic quantum theory to be developed later, M will be a com- 
plexification of spacetime and K(m, m') will be holomorphic in m and 
antiholomorphic in m' . 

The Hilbert space 3?t and the associated function 
K(m, m') are an example of an important structure called a reprodu- 
cing-kernel Hilbert space (see Meschkowski [1962]), which is reviewed 
briefly in the next section. K{m, m') is called a reproducing kernel 
for3f? T . 

We can thus summarize our answer to the first question by saying 
that a function g G L 2 (dfi) belongs to the range of T if and only if it 
satisfies the consistency condition 

g(m)= dn(m')K(m,m')g(m'). (17) 
Jm 

Of course, this condition is only useful to the extent that we have 
information about the kernel K(m,m') or, equivalently, about the 
operator G~ x . The answer to our second question also depends on 
the knowledge of G~ l . For once we know that g = Tf for some / G 7i, 
then 

/ = G~ x Gf = G~ X T* (Tf) = G~ l T*g. (18) 
Thus the operator 

S = G~ 1 T*: L 2 (dy) (19) 
is a left inverse of T and we can reconstruct / by 
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f = Sg = G~ 1 T*Tf 
= G-'J dfj,(m)\hm){h m \f) 

= / dfi(m)G~ 1 \h m )g(m). 

JM 

This gives / as a linear combination of the vectors 

h m = G- X h m . (21) 

Note that 

/ dn{m)\h m ){h m \ = G~ 1 GG~ 1 = G~\ (22) 

JM 

therefore the set 

H M = {h m | m E M} (23) 

is also a frame, with frame constants < B~ x < A -1 . We will 
call Ti M the frame reciprocal to Hm- (In Daubechies [1988a], the 
corresponding discrete object is called the dual frame, but as we shall 
see below, it is actually a generalization of the concept of reciprocal 
basis; since the term "dual basis" has an entirely different meaning, 
we prefer "reciprocal frame" to avoid confusion.) 

The above reconstruction formula is equivalent to the resolutions 
of unity in terms of the pair 7Ym, 'H M of reciprocal frames: 

/ d^(m)\h m )(h m \ = I= [ d^m)\h m )(h m \. (24) 

JM JM 

Corollary 1.3. Under the assumptions of proposition 1.2, the above 
resolutions of unity converge in the strong operator topology of 7i. 
The proof is similar to that of proposition 1.2 and will not be given. 
The strong convergence of the resolutions of unity is important, since 
it means that the reconstruction formula is valid within 7i rather 
than just weakly. Application to / = hk for a fixed k G M gives 

hk = djj,{m) h m K(m,k), (25) 

JM 

which shows that the frame vectors h m are in general not linearly in- 
dependent. The consistency condition can be understood as requiring 
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the proposed function g(m) to respect the linear dependence of the 
frame vectors. In the special case when the frame vectors are lin- 
early independent, the frames Hm and H M both reduce to bases of 
H. If H is separable (which we assume it is), it follows that M must 
be countable, and without loss in generality we may assume that d\x 
is the counting measure on M (re-normalize the /i m 's if necessary). 
Then the above relation becomes 

hk = ^ h m K(m,k), (26) 

and linear independence requires that K be the Kronecker 6: K(m, k) 
— 5^- Thus when the /i m 's are linearly independent, Hm and H M 
reduce to a pair of reciprocal bases for H. The resolutions of unity 
become 

\h m )(h m \ = I= \h m )(hm\, (27) 
and we have the relation 



hk = h m (h m \h k ) = Yl hm 9mk (28) 

mEM ineM 



where 



gmk = (hm\h k ) = (h rn \ Gh k ) (29) 

is an infinite-dimensional version of the metric tensor, which mediates 
between covariant and contravariant vectors. (The operator G plays 
the role of a metric operator.) In this case, 3?t = L 2 (d/i) = £ 2 (M) and 
the consistency condition reduces to an identity. The reconstruction 
formula becomes the usual expression for / as a linear combination of 
the (reciprocal) basis vectors. If we further specialize to the case of a 
tight frame, then G = AI implies that 

(h m \h k ) = A5 k m and ( h m \ h k ) = A" 1 ^, (30) 

so Hm and Ti M become orthogonal bases. Requiring A = B = 1 
means that Hm = H M reduce to a single orthonormal basis. 

Returning to the general case, we may summarize our findings 
as follows: Hm, H m , K(m,m') and g(m,m') = (h m \Gh m i) are 
generalizations of the concepts of basis, reciprocal basis, Kronecker 
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delta and metric tensor to the infinite-dimensional case where, in ad- 
dition, the requirement of linear independence is dropped. The point 
is that the all-important reconstruction formula, which allows us to 
express any vector as a linear combination of the frame vectors, sur- 
vives under the additional (and obviously necessary) restriction that 
the consistency condition be obeyed. The useful concepts of orthogo- 
nal and orthonormal bases generalize to tight frames and frames with 
A = B = 1, respectively. We will call frames with A = B = 1 normal. 
Thus normal frames are nothing but resolutions of unity. 

Returning to the general situation, we must still supply a way of 
computing on which the entire construction above depends. In 
some of the examples to follow, G is actually a multiplication operator, 
so G~ l is easy to compute. If no such easy way exist, the following 
procedure may be used. From AI <G< BI it follows that 



~\{B -A)I<G- \{B + A)I < \{B - A)I. (31) 



Hence letting 



we have 



r B-A , 2 

7 and c=- -, 32 

B + A B + A' v ; 



-SI < I - cG < SI. (33) 
Since < 5 < 1 and c > 0, we can expand 

CO 

G' 1 = C [I - (I - cG)}' 1 = cJ2(I ~ cG) k (34) 

k=0 

and the series converges uniformly since 

\\I - cG\\ < 5 < 1. (35) 

The smaller 6, the faster the convergence. For a tight frame, 5 = 
and cG = /, so the series collapses to a single term G~ x = c. Then 
the consistency condition becomes 

g(m) = c d/i(m') (h m \h ml )g{m'), (36) 

J M 

and the reconstruction formula simplifies to 
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f = c d^(m)h m g(m). (37) 
Jm 

If < <5 <C 1, the frame is called snug (Daubechies [1988a]) and 
the above formulae merely represent the first terms in the expan- 
sions. However, it is found in practice that under certain conditions, 
this first term gives a good approximation to the reconstruction for 
snug frames. It appears to be an advantage to have much linear de- 
pendence among the frame vectors (precisely that which is impossible 

when dealing with bases!), so the transformed function f(m) is "over- 
sampled." For such frames, the oversampling seems to compensate for 
the truncation of the series. A good measure of the amount of linear 
dependence among the Jim's is the size of the orthogonal complement 
of the range of T, which is the null space Af(T*) of T* since 

ge3$ ^ (g\Tf) = VfeH 

(T*g\f) = VfeH (38) 
T*g = 0. 

Suppose we are given some function g(m) G L 2 (dfx) and apply 
the reconstruction formula to it blindly, without worrying whether 
the consistency condition is satisfied. That is, consider the vector 

/ = G~ x T*g (39) 

in 7i. Since g can be uniquely written as an orthogonal sum g = gi+g 2 
where gi G 3ft T and g 2 G ^ = A/"(T*), we find that 

/ = G~ x T*gx (40) 

where g\ does satisfy the consistency condition. This means that 
applying the reconstruction formula to an arbitrary g G L 2 (dfi) results 
in a least-squares approximation f to a reconstruction, in the sense 
that the "error" g 2 = g — gi = g — Tf has a minimal norm in L 2 (dfi). 

For example, suppose we only have information about f(m) for 
m in some subset V of M. Let 

9{m) = \h m ) (41) 
[0 if m f r. 

Then g does not belong to 3?t 7 i n general, and fi = G~ x T*g is the 
least-squares approximation to the (unknown) vector / in view of our 
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ignorance. A similar argument applies if M is discrete and / must be 
reconstructed from Tf by numerical methods. Then we must confine 
ourselves to a finite subset Y of M. The above procedure then gives 
a least-squares approximation to / by truncating the reconstruction 
formula to a finite sum over Y. 

A final note: The usual argument in favor of using bases rather 
than overcomplete sets of vectors is that one desires a unique repre- 
sentation of functions as linear combinations of basis elements. When 
a frame is not a basis, i.e. when the frame vectors are linearly de- 
pendent, this uniqueness is indeed lost since we may add an arbitrary 
function e(m) G 3t^ to f(m) in the reconstruction formula without 
changing / (3?^ 7^ {0} since the frame vectors are dependent). How- 
ever, there is still a unique admissible coefficient function, i.e. one 
satisfying the consistency condition. Moreover, as we shall see, it 
usually happens in practice that the set M, in addition to being a 
measure space, has some further structure, and the reproducing ker- 
nel K(m,m') preserves this structure. For example, M could be a 
topological space and K be continuous on M x M, or M could be a 
differentiable manifold and K be differentiable, or (as will happen in 
our treatment of relativistic quantum mechanics) M could be a com- 
plex manifold and K be holomorphic. Furthermore, K could exhibit 
a certain boundary- or asymptotic behavior. In such cases, these 
properties are inherited by all the functions f(m) in 9ft j-, and then of 
all possible coefficient functions for a given /gH, there is only one 
which exhibits the appropriate behavior. In this sense, uniqueness is 
restored. We will refer to frames with such additional properties as 
continuous, differentiable, holomorphic, etc. 

In addition to properties such as differentiability or holomorphy, 
the kernels K we will encounter will usually have certain invariance 
properties with respect to some group of transformations acting on 
M. This, too, will induce a corresponding structure on the function 
space 3?t- 



1.4. Reproducing— Kernel Hilbert Spaces 

The function K(m, m') of section 1.3 is an example of a very general 
structure called a reproducing kernel, which we now review briefly 
since it, too, will play an important role in the chapters to come. 
The reader interested in learning more about this fascinating subject 
may consult Aronszajn [1950], Bergman [1950], Meschkowski [1962] 
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or Hille [1972]. 

Suppose we begin with an arbitrary set M and a set of functions 
g(m) on M which forms a Hilbert space T under some inner product 
(• | •). In section 1.3, M happened to a measure space, T was 3?t 
and the inner product was that of L 2 (<i / u). But it is important to 
keep in mind that the exact form of the inner product in JF does 
not need to be specified, as far as the general theory of reproducing 
kernels is concerned. Suppose we are given a complex-valued function 
K(m,m') on M x M with the following two properties: 

1. For every m G M, the function K m (m') = K(m',m) belongs to 
T. 

2. For every m G M and every g G J 7 , we have 

g(m) = (K m \g). (1) 

Then T is called a reproducing-kernel Hilbert space and 
K(m, m') is called its reproducing kernel. Some properties of K fol- 
low immediately from the definition. For example, since K m G J 7 , 
property (2) implies that 

K(m\m) = K m {m') = {K m .\ K m ). (2) 
Thus K must satisfy 

(a) K(m,m') = K{m ,m) 

(b) K(m,m) = \\K ni \\ 2 >0 Vm G M (3) 

(c) |K(m,m')| < \\K m \\ H^m'll- 

One of the most important and useful aspects of reproducing- 
kernel Hilbert spaces is the fact that the kernel function itself virtually 
generates the whole structure. For example, the function L(m) = 
\\K m \\ dominates every g(m) in T in the sense that 

\g{m)\ = \ (K m \g) \ < \\K m \\ \\g\\ = \\g\\L(m) (4) 

by the Schwarz inequality. In particular, all the functions in JF inherit 
the boundednes and growth properties of K. UK has singularities, 
then some functions in T have similar singularities. 
From the above it follows that for fixed mo G M, 



sup \g(m )\ = L(m ), 
Nl=i 



(5) 
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the supremum being attained by gm ( m ) = K mo (m)/L(mo), and this 
is, up to an overall phase factor, the only function which attains the 
supremum. This fact has an important interpretation when M is the 
classical phase space of some system, T represents the Hilbert space 
of the corresponding quantum mechanical system and \g(m)\ 2 is the 
probability density in the quantum mechanical state g of finding the 
system in the classical state to. The above inequality then shows that 
the state which maximizes the probability of being at too is uniquely 
determined (up to a phase factor) as g mo . In other words, g mo is a 
wave packet which is in some sense optimally localized at too in phase 
space. 

Another example of how the kernel function K embodies the prop- 
erties of the entire Hilbert space T is the following: If the basic set M 
has some additional structure, such as being a measure space (as it 
was in the setting of frame theory) or a topological space or a C fc , C°°, 
real-analytic or complex manifold, and if K(m,m'), as a function of 
to, preserves this structure (that is, K(m,m') is measurable, contin- 
uous, C fc ,C°°, real-analytic or holomorphic in m, respectively), then 
every function g(m) in T has the same property. 

The question arises: If we are given a Hilbert space JF whose 
elements are all functions on M, how do we know whether this space 
posesses a reproducing kernel? Clearly a necessary condition for K 
to exist is that 

| /(m) | = \(K m \f) \ < \\K m \\\\f\\ (6) 

for all / G T and all to G M. But this means that for every fixed to, 
the map E m : T — > (D defined by 

Em(f) = f{m) (7) 

is a bounded linear functional on T . E m is called the evaluation map 
at m. By the Riesz representation theorem, every bounded linear 
functional E on JF must have the form E(f) = ( e | / ) for a unique 
vector e G T . Hence there exists a unique e m G T such that 

f(m) = {e m \f) (8) 

for all / G T. Since also /(to) = (K m | /), we conclude that e m = 
K m . Thus it follows that given only T such that all the evalua- 
tion maps E m are bounded (not necessarily uniformly in m), we can 
construct a reproducing kernel by 
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K{m,m') = (e m \ e m > ). (9) 

That is, T posesses a reproducing kernel if and only if all the evalua- 
tion maps are bounded. 

In all of the applications we will encounter, the set M will have 
a structure beyond those mentioned so far: it will be a Lie group G 
or a homogeneous space of G. That is, each g e G acts on M as an 
invertible transformation preserving whatever other structure M may 
have such as continuity, differentiability, etc, and these tranformations 
form the group G under composition. Let us denote the action of g 
on M by m h mg (i.e., G acts from the right). Then the operator 

9 (10) 
(T g f)(m) = f(mg) 

gives a representation of G on J 7 , since T s T g / = T gg >. From /(m) = 
( K m | / ) it now follows that 



K mg = T* g K m , (11) 

hence 



K(m'g, mg) = ( K m , g \K mg ) = { K m , \ T g T* K m ). (12) 

Therefore the reproducing kernel K(m', m) is invariant under the ac- 
tion of G if and only if all the operators T g are unitary, i.e. the 
representation g i— > T g is unitary. 

The group G usually appears in applications as a natural set of 
operations such as translations in space and time (evolution) , changes 
of reference frame or coordinate system, dilations and frequency shifts 
(especially useful in signal analysis), etc. Invariance under G then 
means that the transformed objects (wave functions, signals) form a 
description of the system equivalent to the original one, i.e. that the 
transformation changes nothing of physical significance. 

Finally, let us note that the concept of generalized frame, as 
defined in the previous section, is a special case of a reproducing- 
kernel Hilbert space. For given such a frame {h m }, the function 
K(m',m) = ( h m i \ G~ 1 h m ) is a reproducing kernel for the space 3?t 
since (a): 

K m {m') = Kim!, m) = (T(G~ 1 h m ) ) (m') (13) 
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shows that K m = T(G~ 1 h m ) belongs to 9?t, and (b): 

f(m)= [ d f i(m')K(m,m')f(m') = (K m \f)s SiT . (14) 
Jm 



1.5. Windowed Fourier Transforms 

The coherent states of section 2 form a (holomorphic) frame. I now 
want to give some other examples of frames, in order to develop a 
better feeling for this concept. The frames to be constructed in this 
section turn out to be closely related to the coherent states, but have a 
distinct "signal processing" flavor which will lend some further depth 
to our understanding of phase-space localization. 

For simplicity, we begin with the study of functions f(t) of a single 
real variable which, for motivational purposes, will be thought of as 
"time." Although in the model to be built below / is real-valued, we 
allow it to be complex-valued since our eventual applications will be 
quantum-mechanical. The same goes for the window function h. The 
extension to functions of several variables, such as wave functions or 
physical fields in spacetime, is straightforward and will be indicated 
later. We think of f(t) as a "time-signal," such as the voltage going 
into a speaker or the pressure on an eardrum. We are interested in 
the frequency content of this signal. The standard thing to do is to 
find its Fourier coefficients (if / is periodic) or its Fourier transform 
(if it is not periodic). But if / represents, say, a symphony, this 
approach is completely inappropriate. We are not interested in the 
total amplitude 

/oo 
dte 2 ^f(t) (1) 
-oo 

in / of each frequency v. For one thing, both we and the musicians 
would be long gone before we got to enjoy the music. Moreover, 
the musical content of the signal, though coded into /, would be as 
inaccessible to us as it is in f(t) itself. Rather, we want to analyze the 
frequency content of / in reai time. At each instant t = s we hear a 
"spectrum" f(u, s). To accomplish this, let us make a simple (though 
not very realistic) linear model of our auditory system. We will speak 
of the "ear," though actually the frequency analysis appears to be 
performed partly by the nervous system as well (Roederer [1975]). 
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The ear's output at time s depends on the input fit) for t in some 
interval s — r < t < s, where r is a lag-time characteristic of the ear. 
In analyzing f(t) in this interval, the ear may give different weights 
to different parts of /(£). Thus the signal to be analyzed for output 
at time s may be modeled as 



f s (t) = h(t-s)f(t) (2) 

where h(t — s) is the weight assigned to f(t). (As noted above, we are 
allowing h to be complex-valued for future applications, though here 
it should be real-valued; the bar means complex conjugation.) The 
function h(t) is characteristic of the ear, with support in the interval 
— t < t < 0. Such functions are known in communication theory as 
windows. 

Having "localized" the signal around time s, we now analyze its 
frequency content by taking the Fourier transform: 

/oo 
dte^W^jfit). (3) 
-oo 

This function, representing our "dynamical spectrum," is called a 
windowed Fourier transform of / (Daubechies [1988a]). If the window 
is flat (h(t) = 1), f(v, s) reduces to the ordinary Fourier transform. 
On the other hand, if f(t) is a "unit impulse" at t = 0, i.e. /(£) = 

S(t), then f(u,s) = h(—s). Hence h(—s) is the "impulse response" 
(Papoulis [1962]) of the ear. Note that 

/oo 
dtW=s)f(t) (4) 
-oo 

is nothing but the convolution of / with the impulse response. Thus 
the windowed Fourier transform is a marriage between the Fourier 
transform and convolution. Letting 

K, s {t) = e- 2 ™ t h{t-s), (5) 

we have 

f(v,s) = {h v , a \f), (6) 

the inner product being in L 2 (IR). As our notation implies, we want 
to make a frame indexed by the set 
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M = {(z/, s)\u,sE H} ~ IR 2 , (7) 

that is, the time-frequency plane. M corresponds to the "phase space" 
in quantum mechanics in the sense that if t were the position coordi- 
nate, then v would be the wavenumber and lutiv would be the mo- 
mentum. Since we expect all times and all frequencies to be equally 
important, let us guess that an appropriate measure on M is the 
Lebesgue measure, d[i{y, s) = dsdu. Now 

/>, a) = (£./)» (8) 

where h s (t) = h(t — s) is the translated window and " denotes the 
Fourier transform. Thus for "nice" signals (say, in the space of 
Schwartz test functions), Plancherel's theorem gives 



IR 2 



ds dv | f(v, s) | 2 = J ds J dv \ (/i s /)~(z/) 



d, I dl //,(/)/S/) 2 



= J dt | /(£) | 2 J ds\h(t 



s 



2 



where both norms are those of L 2 (IR). This shows that the fam- 
ily of vectors h U}S is indeed a tight frame, with frame constants 
A = B = \\h\\ 2 . Now h„ jS = exp(— 2irivt)h s (t) is just a translation of 
h s in frequency, that is, 

h v , a (i/) = h 8 (u'-u). (10) 

Thus if the Fourier transform h{v) of the basic window is concentrated 
near v = (which, among other things, means that h(t) must be 
fairly smooth, since any discontinuities or sharp edges introduce high- 
frequency components), then that of h VyS is concentrated near v' = v. 
In other words, h v ^ s is actually a window in phase space or time- 
frequency. The fact that these windows form a tight frame means 
that signals may be equally well represented in time-frequency as in 
time. The resolution of unity is 
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\\h\\- 2 [ dsdv\h v ,s){h v ,s \ =1, (11) 
Jtr 2 

and if the consistency condition is satisfied the reconstruction formula 
gives 

f(t) = \\h\\- 2 [ d8dve- 2iri "h(t-8)fc,8). (12) 

As in section 1.2, we denote by Hm the frame formed by the /i^s's. 

Incidentally, the windowed Fourier transform is quite symmetrical 
with respect to the interchange of time and frequency. It can be re- 
written as 

/>, a) = {h s /) » 

= (k * /) iy) (13) 

= e - 2 ™ s J dv' e 2 ™' s h(v< - v )f(i/), 

where h now plays the role of a window in frequency used to localize 
/. As will be seen in the next chapter, the reason for this symmetry is 
that windowed Fourier transforms, like the canonical coherent states, 
are closely related to the Weyl-Heisenberg group, which treats time 
and frequency in a symmetrical fashion. (This is rooted in symplectic 
geometry.) 

The /ij, iS 's are highly redundant. We want to find discrete subsets 
of them which still form a frame, that is we want discrete subframes. 
The following construction is taken from Kaiser [1978c, 1984a]. Let 
T > be a fixed time interval and suppose we "sample" the output 
signal f(v,s) only at times s = nT where n is an integer. To dis- 
cretize the frequency as well, note that the localized signal f n T(f) = 
h n T(t)f(t) has compact support in the interval nT — r < t < nT, 
hence we can expand it in a Fourier series 

fnT(t) = J2e- 27TtmFt c mn (14) 



where F = 1/r and 
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nnT 

Cmn=F dte 2 ^ mFt f nT (t) 
JnT-T 

= F dte 27TimFt h(t-nT)f(t) (15) 

JnT-T 

= Ff(mF,nT). 

The only problem with this representation of f n T is that it only holds 
in the interval nT — r < t < nT, since f n T vanishes outside this 
interval while the Fourier series is periodic. To force equality at all 
times, we multiply both sides by rh n T(t): 

rh nT (t)f nT (t) = r\h nT (t)\ 2 f(t) 

= Y^e-^ imFt h nT {t)f{mF,nT) 

( 16 ) 

= ^2 hmF,nT(t) ( h^ m F,nT \ f )• 
m 

Attempting to recover the entire signal rather than just pieces of it, 
we now sum both sides with respect to n: 

tQT \h(t - nT)\ 2 )f(t) = Y^ h mF,nT{t){hnT,mF \ / )• (17) 
n n.m 

Recovery of fit) is possible provided that the sum on the left converges 
to a function 

g(t) = rJ2\Ht-nT)\ 2 (18) 

n 

which is bounded above and below by positive constants: 

< A < g(t) < B. (19) 

In that case, we have 

£ \(h mF , nT \f)\ 2 = f dtg(t)\f(t)\ 2 (20) 
and hence the subset 
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TiY = {hmF,nT\m,ne7L} (21) 

forms a discrete subframe of Hm with frame constants A and B. 
This subframe is in general not tight, and the operator G is just 
multiplication by g(t), so finding G~ x presents no problem in this 
case. The reconstruction formula is 

f(t)=g(t)- 1 J2 h mF , nT (t)f(mF,nT). (22) 

n,m£Z5 

We are therefore able to recover the signal by "sampling" it in phase 
space at time intervals At = T and frequency intervals Au = F. 
What about the uncertainty principle? It is hiding in the condition 
that the discrete subset h m F n T forms a frame! For we cannot satisfy 
g(t) > A > unless the supports of h(t) and h(t — T) overlap, which 
implies that T < r, or AtAu = T/r < 1. In quantum mechanics, the 
radian frequency is related to the energy by E = ItiUv, so the above 
condition is 

AtAE<2nh. (23) 

This looks like the uncertainty principle going "the wrong way." The 
intuitive explanation for this has been discussed at the end of section 
1.3. 

Notice that the closer we choose T to r, the more difficult it is 
for the window function to be smooth. In the limiting case T = r, 
h(t) must be discontinuous if the above frame condition is to be 

obeyed. As noted earlier, this means that its Fourier transform h(u) 
can no longer be concentrated near v = 0, so the frequency reso- 
lution of the samples suffers. In concrete terms, this means that 
whereas for "nice" windows h(t) we may hope to get a good approx- 
imation to the reconstruction formula by truncating the double sum 
after an appropriate finite number of terms, this can no longer be 
expected when T — > r. In other words, it pays to oversample! "Ap- 
propriate" here means that we cover most of the area in the time- 
frequency plane where / lives. Clearly, if the sampling is done only 
for | n | < AT, we cannot expect to recover f(t) outside the interval 

—NT — t < t < NT. If h(u) is nicely peaked around v — 0, say with a 
spread of Av, and the signal fit) is (approximately) "band-limited," 

so that f{v) m for \v\ > W, then we can expect to get a good 
approximation to f(t) by truncating the sum with m < M, where 
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M is chosen to satisfy MF > W + Av. On the other hand, we may 
actually not be interested in recovering the exact original signal f(t). 
If we are only interested in a particular time interval and a particular 
frequency interval (say, to eliminate some high-frequency noise), then 
the appropriate truncation would include only an area in the time- 
frequency plane which is slightly larger than our area of interest. If 
we sample a given signal / only in a finite subset V of our lattice (as 
we are bound to do in practice) and then apply the reconstruction 
formula, truncating the sum by restricting it to T, then the result is 
a least-squares approximation fi to the original signal. 

On a philosophical note, suppose we are given an arbitrary signal 
without any idea of the type of information it carries. The choice of 
a window h(t) then defines a scale in time, given by r, and it is this 
scale that distinguishes what will be perceived as frequency and what 
as time. Thus, variations of f(t) in time intervals much smaller than 
r are reflected in the z/-behavior of /, while variations at scales much 
larger than r survive in the s-dependence of /. What an elephant 
perceives as a tone may appear as a rhythm to a mouse. 

Finally, I wish to compare the above reconstruction with the 
Nyquist/Shannon sampling theorem (Papoulis [1962]), which gives 

a reconstruction for band-limited signals ( f(v) ps for \v\ > W) 
in terms of samples of f(t) taken at times t = n/2W for integer n. 
Note, first of all, that in the time-frequency reconstruction formula 
above, it was not necessary to assume that /(£) is band-limited. In 
fact, the formula applies to all square-integrable signals. But suppose 
that f(t) is band-limited as above, and choose a window h(t) such 

that h(v) is concentrated around an interval of width Av about the 
origin, with Av < W. Then we expect f(v, s) ~ for \v\ > W + Av 
. In order to reduce the double sum in our reconstruction formula to 
a single sum over n as in the Nyquist theorem, choose F = W + Av. 
Then f(mF,s) ~ whenever m ^ 0. To apply the reconstruction 
formula, we must still choose a time interval T < r. By the uncer- 
tainty principle, rAv > 1/2. The above condition on Av therefore 
implies that r > 1/2W. It would thus seem that we could get away 
with a slightly larger sampling interval T than the Nyquist interval 
T N = 1/2W. Our reconstruction formula reduces to 

f(t) » git)- 1 Kt - nT) /(0, nT). (24) 

n 

The smaller the ratio Av/W, the better this approximation is likely 
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to be. But a small Au means a large r, hence the samples /(0, nT) 
are smeared over a large time interval. 



1.6. Wavelet Transforms 

The frame vectors for windowed Fourier transforms were the wave 
packets 

h„, a (t) = e- 2 ™ t h(t-s). (1) 

The basic window function h(t) was assumed to vanish outside of the 
interval — r < t < and to be reasonably smooth with no steep slopes, 
so that its Fourier transform h(v) was also centered in a small interval 

about the origin. Of course, since h(t) has compact support, h(u) is 
the restriction to IR of an entire function and hence cannot vanish on 
any interval, much less be of compact support. The above statement 
simply means that h(y) decays rapidly outside of a small interval con- 
taining the origin. At any rate, the factor exp(— 2iriut) amounted to a 
translation of the window in frequency, so that h ViS was a "window" in 
the time-frequency plane centered about (v,s). Hence the frequency 
components of fit) were picked out by means of rigidly translating the 
basic window in both time and frequency. (It is for this reason that 
the windowed Fourier transform is associated to the Weyl-Heisenberg 
group, which is exactly the group of all translations in phase space 
amended with the multiplication by phase factors necessary to close 
the Lie algebra, as explained in chapter 3.) Consequently, h VjS has 
the same width r for all frequencies, and the number of wavelengths 
admitted for analysis is vr. For low frequencies with !/r <C 1 this is in- 
adequate since we cannot gain any meaningful frequency information 
by looking at a small fraction of a wavelength. For high frequencies 
with vr >> 1, too many wavelengths are admitted. For such waves, 
a time-interval of duration r seems infinite, thus negating the sense 
of "locality" which the windowed Fourier transform was designed to 
achieve in the first place. This deficiency is remedied by the wavelet 
transform. The window h(t), called the basic wavelet, is now scaled 
to accomodate waves of different frequencies. That is, for a ^ let 

K, s {t) = H~ 1/2 h . (2) 

The factor lal -1 / 2 is included so that 
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/oo 
dt\h a , s {t)\ 2 = \\h\\ 2 . (3) 
-oo 

The necessity of using negative as well as positive values of a will 
become clear as we go along. It will also turn out that h will need to 
satisfy a technical condition. Again, we think of both h and / as real 
but allow them to be complex. The wavelet transform is now defined 
by 



/(a, s) = (H* a \f) = jT dt lal^h (t-?) f{t). (4) 

Before proceeding any further, let us see how the wavelet trans- 
form localizes signals in the time-frequency plane. The localization 
in time is clear: If we assume that h(t) is concentrated near t = 
(though it will no longer be convenient to assume that h has com- 
pact support), then /(a, s) is a weighted average of fit) around t = s 
(though the weight function need not be positive, and in general may 
even be complex). To analyze the frequency localization, we again 

want to express / in terms of the Fourier transforms of h and /. This 
is possible because, like the windowed Fourier transform, the wavelet 
transform involves rigid time-translations of the window, resulting in 
a convolution-like expression. The "impulse response" is now (setting 

/(*) = *(*)) 



g a ( 8 ) = lal'^hi-s/a), (5) 



and we have 



fia,s)= (g a */)(*) = (g a fY(s), (6) 

with 

/oo 
dte 2 ^ vt \a\- 1/2 hi-t/a) = \a\ l/2 hiav). (7) 
-oo 

Later we will see that discrete tight frames can be obtained with cer- 
tain choices of h(t) whose Fourier transforms have compact support 
in a frequency interval interval a < v < (3. Such functions (or, rather, 
the operations of convolutiong with them) are called bandpass filters 
in communication theory, since the only frequency components in fit) 
to survive are those in the "band" [a,/3]. Then the above expression 
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shows that /(a, s) depends only on the frequency component of f(t) 
in the band a/a < v < (3/ a (if a > 0) or (3/a < v < a/ a (if a < 0). 
Thus frequency localization is achieved by dilations rather than trans- 
lations in frequency space, in contrast to the windowed Fourier trans- 
form. At least from the point of view of audio signals, this actually 
seems preferable since it appears to be frequency ratios, rather than 
frequency differences, which carry meaning. For example, going up 
an octave is achieved by doubling the frequency. (However, frequency 
differences do play a role in connection with beats and also in certain 
non-linear phenomena such as difference tones; see Roederer [1975].) 
Let us now try to make a continuous frame out of the vectors h ajS . 
This time the index set is 



M = {(a,s)|a^0,s G IR} ^ IR* x IR (8) 

where IR* denotes the group of non-zero real numbers under multi- 
plication. M is the affine group of translations and dilations of the 
real line, t' = at + s, and this fact will be recognized as being very 
important in chapter 3. But for the present we use a more pedestrian 
approach to obtain the central results. This will make the power and 
elegance of the group-theoretic approach to be introduced later stand 
out and be appreciated all the more. At this point we only make the 
safe assumption that the measure dp on M is invariant under time 
translations, i.e. that 

d/ji(a,s) = p(a)dads (9) 

where p(a) is an as yet undetermined density on IR*. Then, using 
Plancherel's theorem, 

/ dfj,(a,s)\ /(a, s)| 2 =/ p(a) da ds | (g a fY(s) | 2 
Jm Jm 

= / p(a)da / du\g a (u)\ 2 \f(u)\ 2 

= / p(a)da / dv \a\ \ h{ai>) \ 2 | f(y) \ 2 
Jm* Jm 

/oo 
duH(u)\f(u)\ 2 
-oo 

= (f I -^/)l 2 (IR), 

(10) 
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where 



H(u) 



p(a) ■ | a | da | h(av) 



I 



OO 



ada 



p(a) | h(au) | 2 + p{— a) \ h(—au) 



(11) 



The frame condition is therefore A < H{u) < B for some positive 
constants A and B. To see its implications, we analyze the cases 
v > and v < separately. If v > 0, let £ = av. Then 



fT(iz) = z," 2 ^ idi \p{ijv) I a(£) | 2 + ,(-£/!/) | h(-0 | 2 ] 
If z/ < 0, let £ = -ai/. Then 



(12) 



H(u) = I ^ 

10 



p(-tM\h(-t)\ 2 + p(t/v)\h(t) 



(13) 



giving the same expression. Therefore the frame condition requires 
that 



p(Z/v)=0((Z/v)- 2 ) as £/z/^±oo, 



(14) 



unless h(u) vanishes in a neighborhood of the origin. Note that the 
above expression for H(v) shows that if both p(a) and h(u) vanish 
for negative arguments then H{v) =0 and no frame exists. Hence to 
support general complex- valued windows (such as bandpass filters for 
a positive-frequency band), it is necessary to include negative as well 
as positive scale factors a. 

The general case, therefore, is that we get a (generalized) frame 
whenever p(a) and h(t) are chosen such that < A < H(v) < B is 
satisfied. The "metric operator" G and its inverse are given in terms 
of Fourier transforms by 



Gf=(HfY and G- 1 f=(H- 1 jy. 



(15) 



Since G is no longer a multiplication operator in the time domain 
(as it was in the case of the discrete frame we constructed from the 
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windowed Fourier transforms), the action of G~ x is more complicated. 
It is preferable, therefore, to specialize to tight frames. This requires 
that H{y) be constant, so the asymptotic conditions on p reduce to 
the requirement that p be piecewise continuous: 



Pip) = 



c + /a 2 , if a > 
c~/a 2 , if a < 



(16) 



where c + and c are non- negative constants (not both zero). Then 
for v > 0, 



H(u) = 
and for v < 0, 

H(u) = 



c + |M0r + c"IM-0 



1 2 + c+ | /*(-£) | 2 



(17) 



(18) 



Thus iy(^) = A = B requires either that | h(£) | = | h(—£) \ (which 
holds if h(t) is real) or that c 



+ 



Since we want to accomodate 



complex wavelets, we assume the latter condition. Then we have 



A = B = c H 



We have therefore arrived at the measure 



H0\ 2 - 



(19) 



dp(a, s) = c + dads/a 2 (20) 

for tight frames, which coincides with the measure suggested by group 
theory (see chapter 3). In addition, we have found that the basic 
wavelet h must have the property that 

c h = J^^\h(0\ 2 <oo. (21) 

In that case, h(t) is said to be admissible. This condition is also a spe- 
cial case of a group-theoretic result, namely that we are dealing with 
a square-integrable representation of the appropriate group (in this 
case, the affine group IR* x H). To summarize, we have constructed 
a continuous tight frame of wavelets h ayS provided the basic wavelet 
is admissible. The corresponding resolution of unity is 
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c h 



f dads . , . . , 

/ — 2-| V-XV-I =/ - ( 22 ) 

</IR*xIR a 



The associated reproducing-kernel Hilbert space 3?^ is the space of 
functions (Kf)(a,s) = (h ajS \f) = f(a,s) depending on the scale 
parameter a as well as the time coordinate s. As a — > 0, /i a>s becomes 
peaked around t = s and 

;~h v V(s) (23) 



where c = f h(u) du. The transformed signal / is a smoothed-out 
version of / and a serves as a resolution parameter. 

Ultimately, all computations involve a finite number of opera- 
tions, hence as a first step it would be helpful to construct a discrete 
subframe of our continuous frame. Toward this end, choose a funda- 
mental scale parameter a > 1 and a fundamental time shift b > 0. 
We will consider the discrete subset of dilations and translations 

D = {{a m ,na m b) \m,n G 7Z} C R* x JR. (24) 

Note that since a m > for all m, only positive dilations are included in 
D, contrary to the lesson we have learned above. This will be remedied 
later by considering h(t) along with h(t). Also, D is not a subgroup 
of IR* x IR, as can be easily checked. The wavelets parametrized by 
D are 

h mn = a-™/ 2 h ( *~ o n jT & ) = a-™/ 2 h(a- m t - nb). (25) 

To see that this is exactly what is desired, suppose h(u) is con- 
centrated on an interval around v = F (i.e., h is a band-pass filter). 

Then h mn is concentrated around v = F/a m . For given integer m, 
the "samples" 



fmn=(hmn\f), Tl G 7Z (26) 

therefore represent (in discrete "time" n) the behavior of that part of 
the signal /(£) with frequencies near F/a m . If m ^> 1, f mn will vary 
slowly with n, and if m —1, it will vary rapidly with n (if f(t) has 
frequency components with v ~ F/a m ). Now the time-samples are 
separated by the interval A m t — a m b, so the sampling rate 
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R m = l/a m b (27) 

is automatically adjusted to the frequency range v ~ F/a m of the 
output signal {/ mn \n G 7Z}: The high-frequency components get 
sampled proportionately more often. This is an example of a topic in 
mathematics which has recently attracted intense activity under the 
banner of multiscale analysis (Mallat [1987], Meyer [1986]) and which 
is in fact closely related to the subject of wavelet transforms. 

Returning to the construction of a discrete frame, Parseval's for- 
mula gives 



fmn ( h mn | / ) ( h rnn | / 



/oo 
-oo 



dv hmniy) f(u) 



(28) 



where k mn (u) = h mn (u) is the Fourier transform of /i mn (^), 

kmn(v) = a m/2 exp(2nina rn bv) h(a m u). (29) 
We now assume that k(u) = h(u) vanishes outside the interval 

I = {F/a < v < Fa}, (30) 

where F > is some fixed frequency to be determined below. The 
width of the "band" Iq is Wq = (a — a~ 1 )F. Therefore the function 

k(a rn is)f(is) is supported on the compact interval 

I m = [F/a m , F/a™- 1 ] (31) 

of width W m = Wo/a m , and we can expand it in a Fourier series in 
that interval: 

k{a™v) />) = J2 e M^inis/W m ) c mn , (32) 

n 

where 

/oo 
dv exp(-27rmi//W m ) k(a m u) }{y). (33) 
-oo 

Comparing this with 
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/oo 
dua m/2 exp(-2irivnba m ) k(a m v)f(v) (34) 
-oo 



suggests that we choose F so that W m = l/a m 6, which gives 



(35) 



(a 2 - 1)6 
and 

Cmn = Q> ^ bfmn- (36) 

The Fourier series representation above only holds in the interval 
I m , since the left-hand side vanishes outside this interval while the 
right-hand side is periodic. To get equality for all frequencies and 
reconstruct /(£), multiply both sides by k{a m v) and sum over m: 

[ I k(a m v) I 2 ] fty) = bJ2 am/2 ex P (^ina m bv) k(a m u) f mn 

\ m / m,n 

b ^ ^ k mn {y) fmn ■ 

m,n 

(37) 

To have a frame we would need the series on the left-hand side to 
converge to a function x + ( u ) with 

0< A<x + (^) = J2 \Ha m v)\ 2 <B (38) 

m 

for some constants A and B. But this is a priori impossible, since 
h(u) is supported on an interval of positive frequencies and a m > 0, 
so x + ( iy ) vanishes for v < 0. However, we can choose h(t), a and b 
such that x + ( iy ) satisfies the frame condition for v > 0. Negative fre- 
quencies will be taken care of by starting with the complex-conjugate 
of the original wavelet. We adopt the notation 



h + (t) = h(t), h-(t) = h(t). (39) 
Then the Fourier transforms /c ± of are related by 



hT(v) = k+(-u), 



(40) 
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hence k (u) is supported on —Iq = [—-Fa, —F/a\. A similar argument 
to the above gives 

PT \k~{a m v)A f{v) = bY,k- n {v)fmn (41) 

\ m / m,n 

and 

X"M = E IMa"V)| 2 = X + (-^)- (42) 
Hence if x + satisfies the frame condition for v > 0, then 

0<A<x + (^) + X-(^)<5 (43) 

for all z/ 7^ 0. Since {0} has zero measure in frequency space, the 
frame condition is satisfied by the joint set of vectors 

< = K, k- n \m,ne2L}. (44) 
The metric operator 

G =E E \ ke mn)(k* mn \ (45) 

is given by 

(G/)(t) = ((x + + X-)/)-(t) (46) 

and satisfies the frame condition < AI < G < B. Since G is no 
longer a multiplication operator in the time domain (as was the case 
with the discrete frame connected to the windowed Fourier trans- 
form), the recovery of signals would be greatly simplified if the frame 
was tight. The following construction is borrowed from Daubechies 
[1988a]. Let F = a/ (a 2 — 1)6 as above and let k be any non-negative 
integer or k = oo. Choose a real-valued function r\ £ C fc (IR) (i.e., r\ 
is k times continuously differentiate) such that 

^ = {1/2 for £ > 1. ^ 

(Such functions are easily constructed; they are used in differential 
geometry, for example, to make partitions of unity; see Warner [1971].) 
Define h(t) through its Fourier transform k + (u) by 
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sin 



cos 



'I ^F-F/a 



for v < F 
for v > F. 



(48) 



Note that k+(u) is C k since the derivatives of rj(x) up to order 
k all vanish at x = and x = n/2. This means that the wavelets in 
the frame we are about to construct are all C k . Also, k + vanishes 
outside the interval Iq = [F/a, Fa]. The width of its support is 
Wo = (a — a _1 )F, and for each frequency v > there is a unique 
integer M such that F/a < a M v < F, hence also F < a M+1 v < aF. 
Therefore, for v > 0, 



X (y) = sin 



a 



+ cos' 



V 



V 



aF -F 



= 1. (49) 



Thus 



for v < 

1 forf > 0, 



(50) 



i.e., x + ( u ) is the indicator function for the set of positive numbers. 
It follows that x~(z^) is the indicator function for the negative reals, 
and 



X + + X =1 a.e. (51) 
This choice of k + and k~ = k+ gives us a tight frame, 

E E l^mnHklnl =1. (52) 
e=± m,nE7L 

This frame is not a basis; if it were, it would have to be an or- 
thonormal basis since it is a normal frame, hence the reproducing 
kernel would have to be diagonal. But 

K(e, m, n; e', m' , n') = ( k e mn \ k e m , n , ) (53) 

does not vanish for e' = e, n' = n and m' = m ± 1, due to the overlap 
of wavelets with adjacent scales. However, it is possible to construct 
orthonormal bases of wavelets which, in addition, have some other 
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surprising and remarkable properties. For example, such bases have 
been found (Meyer [1985], Lemarie and Meyer [1986]) whose Fourier 
transforms, like those above, are C°° with compact support and which 
are, simultaneously, unconditional bases for all the spaces L P (IR) with 
1 < p < oo as well as all the Sobolev spaces and some other popu- 
lar spaces to boot. Similar bases were constructed in connection with 
quantum field theory (Battle [1987]) which are only C k for finite k but, 
in return, are better localized in the time domain (they have exponen- 
tial decay). The concept of multiscale analysis (Mallat [1987], Meyer 
[1986]) provided a general method for the construction and study of 
orthonormal bases of wavelets. This was then used by Daubechies 
[1988b] to construct orthonormal bases of wavelets having compact 
support and arbitrarily high regularity. 

The mere existence of such bases has surprised analysts and made 
wavelets a hot new topic in current mathematical research. They 
are also finding important applications in a variety of areas such as 
signal analysis, computer science and quantum field theory. They are 
the subject of the next chapter, where a new, algebraic, method is 
developed for their study. 
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Chapter 3 
FRAMES AND LIE GROUPS 



3.1. Introduction 

Although we have not sought to exploit it until now, it is clear that 
all our frames so far have been obtained with the aid of group op- 
erations. The frames associated with the canonical coherent states 
and the windowed Fourier transform were built using translations in 
phase space (Weyl-Heisenberg group), while the wavelet frames used 
translations and dilations (the affine group). In this chapter we look 
for a unifying pattern in these constructions based on group theory. 
We analyze the foregoing constructions in turn, and draw separate 
lessons from each. It will be natural to work in reverse order. The 
affine group, which is, in some sense, the simplest, will lead us to 
the general method. Successive refinements will be suggested by the 
windowed Fourier transform and the canonical coherent states. 



3.2. Klauder's Group— Frames 

This was the first of the group-theoretic constructions, pioneered by 
J. R. Klauder [1960, 1963a, 1963b], who was also the first to apply it 
to the affine group G = JR* x IR (Aslaksen and Klauder [1968, 1969]). 
An element g = (a, s) of G acts on the real line ("time") by dilation 
followed by translation: 



The group-composition law is given by 

g'gt = (a', s')(a, s)t = a' (at + s) + s' = (a'a, as + s')t, (2) 

hence g~ l = (a -1 ,— s/a). (The form of the composition law shows 
that the subgroup of dilations acts on the subgroup of translations, 



gt = (a, s)t = at + s. 



(1) 
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so that G is the semidirect product of the two.) The frame vectors 
for the wavelet transform were obtained from a single "basic wavelet" 
vector h by applying the transformation 



It it easy to see that with respect to the inner product in L 2 (IR), 

(a) (U(a, s)*h) (t) = \ a\ 1 ' 2 h(at + s) = (U(a, s^h) (t), and 

(b) U(a', s')U(a, s)h = U(a'a, a's + s')h. 

In terms of the group operations, this means that U(g)* = U{g)~ x 
and U(g'g) = U(g')U(g), respectively. That is, each U(g) is a uni- 
tary operator (on L 2 (IR) ), and g i— > U(g) is a representation of G. 
This means that U is a unitary representation of G on L 2 (IR). The 
theory of such representations for general groups is a deep and highly 
developed subject, and is of fundamental importance in quantum me- 
chanics, as was realized by Hermann Weyl and others long ago (Weyl 
[1931]). In the broadest sense, group representations amount to a vast 
generalization of the exponential function (think of the map z i— > e az 
from (E to (E*), and unitary group representations generalize the map 
x i— > e iax from IR to the unit circle. 

A representation U of a group G on a Hilbert space H is said to be 
reducible if TL has a non-trivial closed subspace S (i.e., S ^ {0} and 
S 7^ H) which is invariant under U (i.e., U(g)S C S for every g EG). 
If U is unitary and S is invariant, then clearly so is its orthogonal 
complement S . If no such S exists, then U is said to be irreducible. 
It can be shown that the above representation of IR* x IR on L 2 (IR) 
is, in fact, irreducible. 

The method to be described below assumes that we begin with 
a given irreducible unitary representation U of a given group G on a 
given Hilbert space Ti. In addition, we assume that the group is a 
Lie group, meaning that it has a differetiable structure such that it 
is essentially determined (in local terms) by its Lie algebra of left— 
(or right-) invariant vector fields. (See Helgason [1978], Varadarajan 
[1974] or Warner [1971] for background on Lie groups.) The affine 
group and the Weyl-Heisenberg group are examples of Lie groups, as 
is IR n (under vector addition as the group operation). 

Given a general setup (U, G, H ) as above, choose an arbitrary 
non-zero vector h in H. (Klauder dubbed h a "fiducial vector"; for 
the affine group, this was the "basic wavelet" .) For every g e G define 
the vector 
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h g = U(g)h. (4) 

Since U is unitary, \\h g \\ = \\h\\. The /i 3 's are covariant under the 
action of G on H, i.e., 



U(g')h g = U{g')U{g)h = U(g'g)h = h g , g . (5) 

Hence the set of all finite linear combinations (span) of h g 's is invari- 
ant under U, and therefore so is its closure S. Since U is irreducible 
and S 7^ {0}, it follows that S = 7i. This means that every vec- 
tor in 7i can be approximated to arbitrary precision by finite linear 
combinations of /i g 's — a good beginning, if one is ultimately inter- 
ested in reconstruction! To build a frame from the /i 3 's, we need a 
measure on G. Now every Lie group has an essentially unique (up 
to a constant factor) left-invariant measure, which we will denote by 
dfi. This means that if E is an arbitrary (Borel) subset of G, and if 
g\E = {gig | g E E} is its left translate by gi G G, then 

H( 91 E) = / dfi(g) = n(E) ee f dfi(g). (6) 

In local terms, dfi(gig) = dfi(g) for fixed g\. [Similarly, there exists a 
right-invariant measure dfiR on G, which is in general different from 
d/i; if du,R is proportional by a constant to cfy/, the group G is called 
unimodular. Everything we do below can be repeated, with obvious 
modifications, using dfiR, provided that h 9 is redefined as U(g~ 1 )h.] 
To find d\x for the affine group, for example, we write it in the 
form of a density, 

dfi(a, s) = p(a, s) da A ds, (7) 

where da A (is is a differential 2-form denoting the area element in 
1R 2 D G. (da A ds becomes a positive measure upon choosing an 
orientation in 1R 2 and orienting all subsets accordingly; see Warner 
[1971].) Then, for fixed (ai, si) e H* x H, 



dfi((ai, si)(a, s)) = p(a\a, ais + s\) d(aia) A d(ais + s±) 
= a\p(aia, ais + si) da A ds, 



hence left-invariance implies 
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a 1 p(a\a, a\s + s\) = p(a, s). (9) 
Setting ai = 1/a and si = —s/a then shows that 

dfj,(a, s) = da A ds/a 2 , (10) 

where we have chosen the normalization p(l, 0) = 1. This is precisely 
the measure we obtained earlier using a more pedestrian approach. 
Returning to the general case, consider the formal integral 

J= I dn{g)\h g )(h g \. (11) 
Jg 

We want to show that (a) J converges, in some sense, and (b) J is a 
multiple of the identity, thus giving a tight frame in the generalized 
sense defined in chapter 1. Before worrying about convergence, let us 
formally apply U(gi) from the left: 

U( 9l )J= [ dfi(g)U( gi )\h g )(h g \ 
Jg 

dp{g)\h gig )(h g \ 
= I d»(g')\h g ,)(h g -i gl \ W 

J G 

= [ d^g')\h gl ){h gl \U{ gi ) 
Jg 

= JU( 9l ), 

where we have used the left-invariance of the measure and 



( Vv 1 = ( 1 Vv >)* = Pbi 1 ) I V »* = < V I u &)> ( 13 ) 

which follows from unitarity. 

That is, J commutes with every representative U(g) of the group. 
By Schur's Lemma (Varadarajan [1974]), it follows from the irre- 
ducibility of U that J is a multiple of the identity operator on H. 
Since J, if it converges, is a positive operator, we arrive at the desired 
frame condition 
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[ dfi(g)\h g )(h g \ =cl, (14) 

JG 

where c is a positive constant which can taken as unity by the appro- 
priate normalization of d\i. Returning to the formal integral defining 
J, the above argument shows that if the integral converges in some 
sense, it must converge to c/, hence define a bounded operator. (Of 
course c could be infinite!) Thus a necessary condition for convergence 
in the weak sense (i.e. , cts ct quadratic form) is that 



(h\Jh}= / d/i(g) (h\h g )(hg\h\ 

JG 

= [ dfi(g) \ (h\U(g)\h) 

JG 



2 < oo. 



(15) 



We have already encountered this condition in the special case of the 
affine group, where it was called the admissibility condition for h. The 
same terminology is used in the present, general setting. The above 
condition depends both on the representation U and the choice of h. 
If it is satisfied for at least one non-zero vector h, the representation 
U is called square-integrable and h is called admissible. It turns out 
that the existence of an admissible (non-zero) vector is also sufficient 
for the weak convergence of the integral J. The following theorem 
is due to Carey [1976], and Dufflo and Moore [1976]; G can be any 
locally compact topological group, in particular any Lie group. 

Theorem 3.1. Let U be a square-integrable unitary irreducible 
representation of G on a Hilbert space Ti. Then there exists a unique 
self-adjoint (in general unbounded) operator C onli. such that: 

(a) The domain of C coincides with the set of all admissible vectors. 

(b) If hi and h 2 are admissible, then for all fi and f 2 in H we have 



[ d f i(g)(f 1 \U(g)h 1 )(U(g)h 2 \h) = (Ch 2 \Ch 1 )(f 1 \h). (16) 

JG 

(c) If G is unimodular, then C is a multiple of the identity. 

Choosing hi = h 2 = h now leads to the earlier resolution of 
unity, with c = ||C/i|| 2 . As usual, an arbitrary vector / in H can 
be "presented" as a function f(g) = (h g \ f) on G, from which / 
may be reconstructed as a linear combination of h g J s weighted by the 
reproducing kernel K(g' , g) = {h g i \ h g ) . 
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Note that the basic wavelet h corresponds to 

h(g) = (h g \h) = (h\U(g)h), (17) 

so the admissibility condition is nothing but the requirement that h 
belong to L 2 {d[i). 

A potentially interesting generalization of this scheme is actually 
possible. By the foregoing theorem we can use two distinct admissible 
vectors: the vector hi is used to analyze f, i.e. present it as f(g) = 
((h>2)g\f), while the vector h\ is then used to synthesize f from 
f(g). There is no fundamental reason to use the same "wavelets" for 
analysis and synthesis, provided only that they overlap in the sense 
that 

(Ch 2 \Ch 1 )^0. (18) 

Absorbing the reciprocal of this constant into the group measure, the 
map T: / i— > / is an isometry from 7i onto its range 3?t- 

Due to the "covariance" of the /i g 's with respect to the action of 
G, the representation of G on acquires the simple geometric form 

(u(g 1 )f)(g) = (h g \U(g 1 )f) 

=<vj/> (19) 

= /art), 

showing that G acts on $t T by merely translating the variable in the 
base space G. The realization of / by / and U by U as above is called 
the coherent-state representation determined by the pair (U,h). We 
will also refer to the frame { h g \ g G G} as the group-frame (G- 
frame) associated with (U,h). From a purely mathematical point of 
view, one of the attractions of this scheme is that although we started 
with an arbitrary representation of G on an arbitrary Hilbert space, 
this construction "brings it home" to G itself and objects directly as- 
sociated with it: the Hilbert space is a closed subspace of L 2 (d(x), and 
the representation is induced from the (left) action of G on itself, i.e. 
g i— > g± 1 g. That is, Klauder's construction exhibits U as a subrepre- 
sentation of the regular representation of G. (See Mackey [1968] for 
the definition and discussion of the regular representation.) 
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3.3. Perelomov's Homogeneous G— Frames 

Let us now attempt to apply the group-theoretic method to the win- 
dowed Fourier transform. As most of our applications will be to the 
phase-space formulation of quantum mechanics, we shift gears and 
replace the time by a space coordinate, t — > — x (the sign is related to 
the Minkowski metric; see section 1.1) and the frequency by a momen- 
tum coordinate, 2tvu — > p. Although it is a trivial matter to extend 
everything we do in this section to an arbitrary (finite) number of de- 
grees of freedom (where x and p belong to IR S ), we restrict ourselves 
to a single degree of freedom to keep the notation simple. The self- 
adjoint generators of translations in space and momentum, P and X, 
are defined by 

Pf(x) = -i^f(*)> ( x fT(p) = i^/(p). (1) 
Expressed in the space domain, 

Xf(x) = xf(x). (2) 

Starting with a basic window function h(x), the window centered at 
(p, x) in phase space is given by 



ipx („-ixP h ^ ( x tj ( 3 ) 



h p , x (x) = e ipx 'h(x' -x 

= e ipx ' (e 

= (e ipX e~ lxP h) (x'). 

That is, h PjX is obtained from h by a translation in space (by x) 
followed by a translation in momentum (by p). Defining the corre- 
sponding unitary operators 

U(p,x) = e ipX e~ lxP , (4) 

let us see what happens when two such operations are applied in 
succession: 

(U(p 1 ,x 1 )U(p,x)h) (x) = (U(p u xi)h PjX ) (x 1 ) 

6 Pl hp^ x (x X\) 
= e ipiX 'e ip ( x '- Xl) h(x' - Xl -x) (5) 

° °Pi -rP,%l V / 

= e~ ipXl U{pi +p,xi +x)h(x'). 
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Hence the operators U(p, x) do not form a group, since two successive 
operations give rise to a "multiplier" exp(— ipx\). The reason is that 
translations in space do not commute with translations in momentum, 
as can also be seen at the infinitesimal level by noting that their 
respective generators obey the "canonical commutation relations" 

= il. (6) 

The remedy (suggested by Hermann Weyl) is to include the identity 
operator as a new generator (it generates phase factors e -1 ^ which 
can be used to absorb the multiplier) . To see this in terms of unitary 
representations of Lie groups, consider the abstract real Lie algebra 
w with three generators {—iS, —iT, —iE} and Lie brackets 

[S,T]=iE, [S,E}=0, [T,E}=0. (7) 

The corresponding three-dimensional (simply connected) Lie group 
W is known as the Weyl-Heisenberg group. Topologically, W is just 
IR 3 . (If configuration space is IR S , the corresponding Weyl-Heisenberg 
group W s « IR 2s+1 .) A general element in W, parametrized by 
(p, x, (p) G IR 3 , may be expressed as the product of three factors 

g(p, x, <f>) = exp(—i(/)E) exp(ipS) exp(— ixT), (8) 

where "exp" denotes the exponential mapping from the Lie algebra 
w to the Lie group W, whose group law is 

g(p 1 ,x 1 ,(f) 1 )g(p,x,(f)) = g{p x +p,x x + x, 0i +(f) + px 1 ). (9) 

A unitary irreducible representation of W on L 2 (IR) is obtained by 
the correspondence S — > X, T — > P, E — > /. This is known as 
the Schrddinger representation. The unitary operator correponding 

to g(x,p,(f>) is 

U(p, x, 0) = e~ l(t> e ipX e~ lxP . (10) 

As expected, these operators are closed under multiplication, with the 
composition law 



x, 



.d_ 
dx 



U(pi,xi,4>i)U(p,x,(l)) = U(pi +p,X! +X,(j) 1 + 4> + pxi). 
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With this unitary irreducible representation of W on L 2 (IR), we 
have all the ingredients needed to attempt the construction of a group 
frame for W. The prospective frame vectors are 

h P , x ,4> = U(p,x, 4>)h = e~ l4> h p , x , (11) 

where h P}X are the vectors defined earlier. The left-invariant measure 
on W (which, in this case, is also right-invariant) is just Legesgue 
measure on IR 3 , given by the differential form dp Adx A d(p. This can 
be seen by looking at the composition law: For fixed (p±,xi, 0i), 



d(pi + p) A d(xi + x) A d((pi + (fi + pxi) = dp A dx A dtp (12) 

since dp A dp = 0. The method of section 2 then gives the following 
candidate for a resolution of unity: 

J = dp A dx A d(f> ] hp^rf ) ( h PjX ^ | 

• /vv (13) 
dp A dx A d(j) | h p , x ) ( h PjX \ , 

w 

where the phase factor cancels in the integrand. This integral clearly 
diverges, since the integrand is independent of <fi and the integration 
is over all real (f>. Equivalently, the representation U is not square- 
integrable, since for every nonzero /i, 

c^J dpdx ^\(k\U( P , X , m \^co (14) 

due, again, to the constancy of the integrand in (/). We could get 
around the problem by choosing a multiply connected version Wi of 
W, say with < <p < 2n. (Wi has the same Lie algebra as W). 
This would indeed give a tight frame, but this frame is unnecessarily 
redundant (as opposed to the beneficial sort of redundance associ- 
ated with oversampling) since the vectors h PiX ^ are not essentially 
different for distinct values of (p. More significantly, we would miss an 
important lesson which this example promises to teach us. For other 
important groups, as we will see, the problem cannot be circumvented 
by compactifying the troublesome parameters. The following solution 
was proposed by Perelomov [1972] (see also Klauder [1963b, p. 1068], 
where this idea is anticipated). To get rid of the ^dependence, choose 
a "slice" of W, 4> = a(p,x), and define 
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h a = h I ^ = e ~MP,x)h (ik) 
lb p,x lb p,x,a(p,x) c "75, x- \ ±0 ) 

Integrating only over this slice with the measure dp A dx, we get 



J' = J d P Adx\ h£ x ){h% tX I = c h I, 



(16) 



where Ch = 27r||/i|| 2 , since the integral reduces to the same one we had 
for the windowed Fourier transform. 

From a computational point of view this is, of course, trivial. But 
to extend the technique to other groups we must understand the group 
theory behind it. Suppose, then, that we return to the general setup 
we had in the last section: Given a unitary irreducible representation 
U of a Lie group G on a Hilbert space 7i, choose a nonzero vector h 
in H and form its translates h g = U(g)h under the group action as 
before. Consider now the set H of of all elements k of G for which 
the action of U(k) on h reduces to a multiplication by a phase factor 
X(k) = exp[z0(/c)]: 

U(k)h = X (k)h, keH. (17) 

In the case of W, H consists of all elements of the form k = (0, 0, 4>) 
and U(k) is, in fact, nothing but a phase factor. However, this is 
deceptive, since in general U(k) may be a non-trivial operator, acting 
trivially only on some vectors h. Hence, in general, H will in fact 
depend on the choice of h and should properly be designated H(h). 
Now for two elements k\ and of H, we have 

U(hk^)h = U^Uih)-^ = X (h)x(k2)- 1 h, (18) 

hence kik^ 1 also belongs to H and it follows that if is a subgroup of 
G. Furthermore, the above equation shows that the map 1 — > is 
a group-homomorphism of H into the unit circle, thus it is a charac- 
ter of H, i.e. a unitary representation on the one-dimensional Hilbert 
space (E. H is called the stability subgroup for /i, and its Lie algebra 
is called the stability subalgebra. The reason for this terminology is 
that H does not affect the quantum-mechanical state defined by h, 
since all observable expectation values in that state are given in terms 
of sequilinear forms in h. More precisely, if we choose \\h\\ = 1, the 
corresponding state is by definition the rank-one projection operator 

P= \h)(h\ (19) 
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and the expected value of any observable A in this state is given by 

(A) = (h\Ah) = trace (PA) . (20) 

(This formulation, besides avoiding irrelevant phase facors, also per- 
mits states which are statistical mixtures of pure states as needed, 
for example, in statistical quantum mechanics.) The translate of P 
under a general group element g is then 

P 9 = \ h 9 )(h g \ = U{g)PU{g)\ (21) 

hence for g in H we have P g = P, i.e., P is stable under H as the 
name implies. There is, therefore, some advantage to formulating 
the theory as much as possible in terms of states rather than Hilbert 
space vectors since this automatically eliminates the fictitious degree 
of freedom represented by the overall phase. [Incidentally, a similar 
situation appears to prevail in communication theory, since an overall 
phase shift has no effect whatsoever on the informational content of 
the signal.] However, the Hilbert space vectors will play an important 
role in connection with holomorphy (recall that in the coherent-state 

representation, f(z) is analytic, whereas | f(z) | 2 is not), hence we 
work primarily with them. 
If g G G and k G H, then 

h gk = U(gk)h = U(g)U(k)h = X (k)h g , (22) 

hence P g k = P g . That is, P g is the same for all members of the left 
coset 

gH = {gk\ke H}. (23) 

The set of all translates P g is therefore parametrized by the left coset 
space 

M = G/H = {gH | g G G}. (24) 

Members of M will be alternatively denoted by m and by gH, and 
will play a dual role: as points in M, and subsets of G. In the case 
of W, for example, M is parametrized by (p, x) G IR 2 , i.e. it is phase 
space, precisely the label space for the frame we obtained. The coset 
(p, x)H is a straight line in W ~ IR 3 . (We will see in the next chapter 
that W can be interpreted as a degenerate non-relativistic limit of 
phase space x time and (p,x)H then corresponds to the trajectory of 
a free classical particle in W.) 
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To build a frame, we take a "slice" of G by choosing a represen- 
tative from each coset, i.e. choosing a map 

a: G/H -> G. (25) 

Note: This is actually a non-trivial process. The projection G — > 
G/if defines a iiber bundle, and a is a section of this bundle; in 
general a can be chosen smoothly only in a neighborhood of each 
point of G/H, i.e., locally (see F. Warner [1971], p. 120). However, 
this is sufficient for our purposes, since we will ultimately deal only 
with the state P m , which is independent of the choice of a. # 

Thus a has the form a(gH) = g a(g) for some function a: G — > H. 
In the case of W, we had cr(p, x) = (p, x, a(p, x)). The frame vectors 
corresponding to this choice are 

Kn = K{m) = U(a(m))h. (26) 

To build a frame, we need two more ingredients: an action of G on the 
label space M, and a measure on M which is invariant with respect 
to this action. The action is easy, since G acts naturally on G/H by 
left translation: 

9l (gH) = (g l9 )H. (27) 

If m = gH G M, we will denote {g\g)H by g\m. M is called a 
homogeneous space of G. As for an invariant measure, it exists, 
in general, only subject to a certain technical condition (Helgasson 
[1962], p. 369). It does exist whenever G is a unimodular group (its 
right- and left-invariant measures are proportional by a constant), 
such as W. Let us assume that a G-invariant measure does exist on 
M (in which case it is unique, up to a constant factor) and denote it 
by d\x M . Once more we consider the formal integral 

J=f dfi M (m)\h^){h^\ = I dfi M (m)P m . (28) 

If we can show that J commutes with every U(gi), then irreducibility 
again forces J = cl. But 
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U(g 1 )h° gH = U(g 1 )U(a(gH))h 
= U( 9l a(gH))h 

= U( gi ga(g))h (29) 
= U(g ig a(g 1 g)a(g 1 g)- 1 a(g)) h 
= xi^igig^aig^h^gH. 

That is, the vectors are "almost" covariant under the action of G, 
with a residual phase factor (multiplier). This means that the states 
P m transform covariantly, i.e., 

U(gi)PmU( gi y = P gim . (30) 

Thus 

U(g 1 )JU(g 1 )- 1 = [ d^ M (m)U( gi )P m U( gi y 

= / dn M (m)P gim 

Jm (31) 

= / dii M (m')P m/ 
Jm 

= J, 

using the unitarity of U(g\) and the invariance of d\x M . This proves 
that J = c/, provided the integral converges in some sense. 
Note: The above proof becomes shorter if one works directly with 
the states P m rather than the frame vectors; however, it is important 
to see the action of G on the frame vectors in terms of the multipli- 
ers since this will play a role in the next section, where we look for 
representations of G on spaces of holomorphic functions. # 

A necessary condition for the weaic convergence of the integral is 
that 

(h\Jh)= [ dfi M (m) |(/CIM! 2 <°o, (32) 

i.e., that h(m) = (h 1 ^ \ h) be in L 2 (dfx M ). As before, this condition 
is also sufficient, and the same terminology is used: h is called ad- 
missible, and U is called square-integrable. The action of U in the 
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Hilbert space of functions /(to) = (h^lf) is somewhat more com- 
plicated than earlier because of the multiplier. (If we simply dropped 
the multipliers we would no longer have a unitary representation of 
G but a projective representation; see Varadarajan [1970].) 

The above construction has the advantage that the trivial part of 
the action of G is factored out, thereby improving the chances that the 
integral J converges. As mentioned above, it depends on the existence 
of the invariant measure dfi M on G/H, which is not guaranteed. Note 
that for fixed g G G, the stability subgroup for the vector U(g)h is 
gHg~ l , i.e., a subgroup of G conjugate to H. In general, however, 
the stability subgroups of two admissible vectors h\ and may not 
be conjugates. It may happen that GjH\ has an invariant measure 
while G/H2 does not. It pays, therefore, to choose the vector h very 
carefully. Intuition suggests that h should be chosen so as to maximize 
the stability subgroup, since this will minimize the homogeneous space 
M and improve the chances for convergence. Furthermore, maximal 
use of symmetry would seem to make it more likely that an invariant 
measure exists on the quotient. More will be said about this in the 
next section, in connection with the "weight" of a representation. 

We will refer to the frame { h a m } as a homogeneous G-frame as- 
sociated with (U, h). The dependence on a will usually be suppressed, 
since a change in (the local sections) a gives an equivalent frame. 



3.4. Onofri's Holomorphic G— Frames 

The frames associated with the windowed Fourier transform and the 
canonical coherent states are similar in that both provide "phase 
space" representations of functions in L 2 (1R S ). The distinguishing fea- 
ture is that the representation determined by the canonical coherent 
states is in terms of holomorphic (analytic) functions. Neither of the 
two general group-theoretic methods covered so far explains the origin 
of this analyticity, yet it has been found that all such group-related 
phase-space representations have analytic counterparts. Moreover, 
analyticity will play a key role in the coherent-state representations 
of relativistic quantum mechanics (next chapter) and quantum field 
theory (chapter 5). The method to be described in this section as- 
sumes we are dealing with a compact, semisimple group. Since the 
coherent-state representations we will develop for relativistic quan- 
tum mechanics are based on the Poincare group, which is neither 
compact nor semisimple, the present considerations do not apply di- 
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rectly to the main body of this book (chapters 4 and 5). Neverthe- 
less, we describe them in considerable detail in the hope that they 
may shed some light on our later constructions, which are still not 
well-understood in general terms. 

Let us return to the canonical coherent states in order to isolate 
the property leading to analyticicy and find its generalization to other 
groups. We follow an approach first advocated by Onofri [1975]. For 
related developments, also see Perelomov [1986]. Again consider the 
three-dimensional Weyl-Heisenberg group W, whose (real) Lie alge- 
bra w has a basis {—iS, —iT, —iE} which is represented on L 2 (IR) by 
S — > X, T — > P and E —> I. Recall that we arrived at the canoni- 
cal coherent states \z as eigenvectors of the non-hermitian operator 
A = X + iP, which represents the complex combination S + iT of 
generators in w. Since w is a real Lie algebra, we must complexify 
it in order to consider such combinations of its generators. This is 
done in the same way as complexifying a real vector space, namely by 
taking the tensor product with the field of complex numbers. With 
obvious notation, w c = w(g>(E = w + iw. As will be seen below, 
complex combinations such as S ± iT play a very important role in 
the theory of reai Lie groups, exactly for the same reason that com- 
plex eigenvectors and eigenvalues are necessary in order to study real 
matrices. 

We begin by rederiving the canonical coherent states from an 
algebraic point of view which shows their relation to the vectors h PjX 
associated with the windowed Fourier transform and pinpoints the 
property which makes them analytic. In the last section we found 
that 

h p , x = e^ x e~™ p h. (1) 

Now if B and C are operators such that [B, C] commutes with both 
B and C, then the Baker-Campbell-Hausdorff formula (Varadarajan 
[1974]) reduces to 

e B e c =e i[ B) C] e B+C (2) 
Since [ipX, —ixP] = ipxl, we therefore have 

hp, x = exp(ipx/2) exp (ipX — ixP) h. (3) 

Substituting 



X = (A*+A)/2 and - iP = (A* - A)/2 (4) 
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and denning z = x — ip, we get 

h p>x = exp(ipx/2) exp [zA* /2 - /i. (5) 

So far, all our manipulations have been justifiable since we have only 
exponentiated skew-adoint operators. The next one is more delicate 
since the operators to be exponentiated are not skew-adjoint; it will 
be justified later. Using the Baker-Campbell-Hausdorff formula again 
with B = zA*/2 and C = -zA/2, write 



h P;X = exp (ipx/2 — zz/A) exp (zA* /2) exp (—zA/2) h. (6) 
If we choose 

h(x') = Nexp(-x' 2 /2) = X o(x') (7) 

(where N = (27r) _1//4 and xo is just the canonical coherent state Xz 
with z = 0), then Ah = 0, hence exp (— zA/2) h = h and 

h PtX = exp (ipx/2 - zz/A) exp (zA* /2) xo- (8) 
We claim that 

exp (^72) xo = Xz- (9) 

This can be seen by applying A to the left-hand side, then using 
Axo = and [A, A*] = 21: 

Aexp(M*/2) xo = [Aexp(^4*/2)] X o 
= z exp(zA*/2) xo, 

which shows that exp(^A*/2) xo equals Xz up to a constant factor, 
which is easily shown to be unity. That is, 

h P:X = exp (ipx - zz/A) Xz, (H) 

so the canonical coherent states are a special case (modulo the z- 
dependent factor in front) of the frame vectors h PjX for the choice 
h = xo- Note that the corresponding states are related by 

\h PtX )(hp, x \ = exp (-\z\ 2 /2) \ xz)(Xz\, (12) 
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giving the weight function on the right-hand side in the bargain. The 
reason for this is that the /i P)X 's were obtained by unitarily translating 
h, whereas the operator exp (zA*/2) which "translates" %o to Xz is not 
unitary but results in ||x z || = exp (|z| 2 /4) ||xo||j the weight function 
then corrects for this. 

We can now justify the fine point we glossed over earlier. The 
expression 

exp (zA*/2) exp {-zA/2) Xo (13) 

makes sense because 
(a) 

e~ zA/2 Xo = Xo (14) 

since Axo = 0, and 
(b) xo is an analytic vector (Nelson [1959]) for the operator A*, since 

00 i 00 \~\n 

E^IK^f X„ll=£^P*"xoll 

"° ' - . ,. " =0 ' < 15 > 

= E^t 2 " /2 ^IIxoII<~ v*e<E, 



2 n nl 

n=0 



where ||A* n xo|| = 2 n / 2 v / n! follows from [A, A*] = 21 and A X o = 0. 

As will be seen below, the key to constructing representations of 
real Lie groups on spaces of analytic functions will be to 

(a) complexify the Lie algebra; 

(b) find a counterpart to Axo = 0; 

(c) define counterparts to Xz = exp(^A*/2) %o- 

Before embarking on this task, we must make a brief excursion 
into the structure theory of Lie algebras. For background and details, 
see Helgason [1978] or Hermann [1966]. The Lie groups and -algebras 
we consider below are assumed to be real and semisimple. (A Lie 
algebra is semisimple if it contains no proper abelian ideals; a Lie 
group is semisimple if its Lie algebra is semisimple.) Since W is not 
semisimple (the subspace spanned by — %E is an ideal of w), these 
results do not apply to it, strictly speaking. However, as will be 
shown in section 3.6, W can be obtained as a (contraction) limit of a 
simple Lie group (SU(2)), and this turns out to be sufficient for our 
purpose. Thus, let G be an arbitrary real, semisimple Lie group and 
g its Lie algebra. The following is known: 
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1. Every element X of g defines a linear map adX on g by 
a,dX(Y) = [X, Y]. Similarly, every Z G g c defines a complex- 
linear map (denoted by a,dZ) on g c . It therefore makes sense to 
look for eigenvalues and eigenvectors of ad Z. 

2. g has a (Lie) subalgebra h called a Cartan subalgebra, defined 
as a maximal abelian subalgebra of g satisfying an additional 
technical condition, h is analogous to a "maximal commuting 
set of observables" in quantum mechanics. Its complexification, 
denoted by h c , is a maximal abelian subalgebra of g c . (The tech- 
nical condition on h essentially ensures that each of the linear 
maps ad-ff, with H G h c , has a complete set of eigenvectors in 
g c , hence that all the linear transformations adH with H G h c 
can be diagonalized simultaneously.) 

3. For every complex-linear form a: h c — > (C , let 



That is, g a is the set of all vectors in g c which are common 
eigenvectors of a,dH for every H in h c , with corresponding eigen- 
value a(H). For generic ct, no such eigenvectors will exist, hence 
g a = {0}. When g a 7^ {0}, a is called a root (of g c with respect 
to h c ), each Z in g c is called a root vector and g a is called a root 
subspace of g c . Clearly, the zero-form o(i7) = is a root, since 
the fact that h c is abelian means that g° contains h c . The fact 
that h c is maximal-abelian means that actually g° = h c , since 
every Z in g° commutes with all H in h c . 
4. Let Z a G g a and Zp G gP ', where a and are arbitrary linear 
forms on h c . Then the Jacobi identity implies that for every H 
in h c , 



5. The set of all nonzero roots is denoted by A. g c has a direct-sum 
decomposition (root space decomposition) as 



g a = {Ze gc \ [H,Z] 



a(H)Z V#Gh c }. 



(16) 




(17) 



(18) 



g c = h c + J2& a > 



(19) 
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and each g a (with a G A) is a one-dimensional subspace of g c . 

6. The Killing form of g c is the bilinear symmetric form defined by 

B(Z, Z') = trace (ad Z ad Z') . (20) 

It is non-degenerate on g c , and its restriction to h c is also non- 
degenerate. Since a non-degenerate bilinear form on a vector 
space defines an isomorphism between that space and its dual, 
the restriction of B to h c defines an isomorphism between h c and 
h*. The vector in h c corresponding to a G h* is denoted by H a . 
It is defined by 

B(H a ,H) = a(H) VHeh c . (21) 

Note that the vector corresponding to a = is Hq = 0. 

7. If a G A, then —a G A and a(H a ) = B(H a , H a ) ^ 0. Further- 
more, 

[g a ,g- a ] = QH a , (22) 

i.e. the set of all brackets [Z a ,Z_ a ] with Z a G g a fills out the 
one-dimensional subspace spanned by H a . (H a ^ since a G A.) 

8. Any two root subspaces g a and g^ with a+(3 ^ are "orthogonal" 
with respect to B. 

9. It is possible to choose (non-uniquely) a subset A + of A, called 
a set of positive roots, such that 

(a) If a and j3 belong to A + and a + (3 is a root, then a + (3 
belongs to A + . 

(b) The set A~ = — A + is disjoint from A + . 

(c) A = A+UA". 

It follows from (4) and (9) that 

n+ = S° and n " = g ° ( 23 ) 

a6A+ aeA~ 

are Lie subalgebras of g c , and by (5), 

g c = h c + n+ + n-. (24) 

Since there can only be a finite number of roots, (4) and (9) also imply 
that after taking a finite number of brackets of elements in either n + 
or n~ we obtain zero; that is, the subalgebras n ± are nilpotent. We 
may choose a basis for g c as follows: Pick a non-zero vector Z a from 
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each g a with a G A + . Then by (7) there is a unique vector Z- a in 
g~ a with 

] = 2H a /a(H a ). (25) 
Since Z a and Z- a are root vectors, they also satisfy 



[H a ,Z a ] = a(H a )Z a 
[H a ,Z- a ] = —a(H a )Z- a , 



(26) 



and a(H a ) ^ (property (7)). If we define 

C a = H a /a(H a ), (27) 

then each triplet (Z a , C a , Z- a ) spans a Lie subalgebra of g c which is 
isomorphic to sl(2, (C ): 



[C a , Z± a ] — ±Z± a , [Z a , Z_ a ] — 2C a . (28) 

The root space decomposition then shows that g c is a direct sum of 
copies of sl(2, (D), indexed by A + . 

The connection with the non-hermitian combinations X ±iP can 
now be explained. The following argument is heuristic and has no 
pretense to rigor. Its sole purpose is to motivate the construction 
of general holomorphic frames. It will be made precise in section 
3.6 at the level of unitary representations, where a clear geometric 
interpretation will be given. 

Begin with the group G = SU(2), which is a simple, real Lie group 
whose Lie algebra g = su(2) has a basis {— iJ\, —1J2, —1J3} satisfying 

[Ji, J2] = 1J3 

[J2,J 3 ]=iJi (29) 
[J3, Ji] = ih- 

We first show that w can be obtained as a "contraction limit" of g.* 
Choose a positive number k and define K\ = kJ±, K2 = KJ2 and 
K 3 = k 2 J 3 . These form a new basis for g, with 



* The idea of group contractions is due to Inonii and Wigner [1953]. 
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[K u K 2 ] = iK 3 

[K 2 ,K 3 ] = in 2 K 1 (30) 
[K 3 ,K 1 ] = ik 2 K 2 . 

In the limit k — > 0, g becomes isomorphic to w. We will see in 
section 3.6 that within a unitary irreducible representation of 577(2), 
the operator K 3 can be chosen so that K 3 — > I as k — > 0, hence we 
may interpret the limits of i^i and 7^ as X and P, respectively. 

Now apply the above structure theory to g c , which is just 
sl(2, (D). The direct sum of copies of sl(2, (E) therefore reduces to a 
single term. For the Cartan subalgebra of g we can choose h = IR J 3 
(the one-dimensional subspace spanned by J3), so that h c = (DJ3. 
Two linearly independent root vectors are given by J± = J\ ± iJ 2 , 
with 



[J 3 ,J+] = J+, [J 3 ,J_] = -J_. (31) 

We choose ct( J3) = 1 as the single "positive" root and J± for the basis 
vectors of the two one-dimensional root subspaces. Then 



[J+,J_]=2J 3 (32) 

shows that C Q corresponds to J3. Setting K± = kJ± and K 3 = k, 2 J 3 , 
and taking the limit k — > 0, we obtain the correspondence 



K+ ^ S + iT ^ X + iP = A 

K_ ^ S - iT ^ X - iP = A* (33) 

where we have used "1— >" to denote the representation of w on L 2 (IR) . 
[This correspondence is not unique; for example, K + — > A*, if_ — > A, 
i^3 — > — £7 is equally good. As will be shown in section 3.6, both of 
these correspondences actually occur as weak limits, due to the fact 
that an irreducible representation of 577(2) contracts to a reducible 
representation of w.] Note that the three roots {k 2 , 0, — n 2 } all merge 
into a single root, zero, in the contraction limit. Thus the operators 
A and A* are interpreted as the contraction limits of root vectors. 
Note: Another way of seeing the importance and naturality of A 
and A* is in the context of the Kirillov-Kostant-Souriau theory, 
sometimes called "Geometric Quantization," applied to the Oscillator 
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group (Streater [1967]; see section 3.6). For background on Geometric 
Quantization, see Kirillov [1976], Kostant [1970] and Souriau [1970]; 
see also Guillemin and Sternberg [1984], Simms and Woodhouse [1976] 
and Sniatycki [1980]. 

We are, at last, ready to generalize the construction of group- 
representations on spaces of holomorhic functions to other groups. 
We begin with a semisimple, real Lie group G and a unitary irre- 
ducible representation U of G on a Hilbert space Ti. To avoid tech- 
nical difficulties, we will here assume that G is compact. (The rea- 
son for assuming compactness, as well as ways to get around it, will 
be discussed below.) Then the irreducibility of U implies that Ti be 
finite-dimensional, hence the operators U(g) representing group oper- 
ations are just unitary matrices and the operators U(X) representing 
elements of g are skew-adoint matrices. (Sometimes the operator 
representing X is written as dU (X) to emphasize its "infinitesimal" 
nature; we will write it as U(X) to keep the notation simple.) At the 
Lie algebra level, U extends, by complex-linearity, to a representation 
T of g c : 

T(X + iY) =U(X) +iU(Y). (34) 

T is the unique representation of g c that extends U; that is, for all 
ci, c 2 G (C and all Z\, Z 2 € g c , 

T{c 1 Z 1 + c 2 Z 2 ) = ciT(Zi) + c 2 T(Z 2 ) 

T([Z 1 ,Z 2 ]) = [T(Z 1 ),T(Z 2 )} (35) 

T(X) = U(X) if legcg, 

The matrices T(Z) are no longer skew-adjoint, but because of the 
finite dimensionality of Ti, there is no problem in exponentiating them 
to give a representation of G c on Ti, which we also denote by T. Thus, 
the representative of the group element exp Z of G c is defined by* 

T(expZ) =exp[T(Z)], Z e g c . (36) 

Note: If G is non-compact, exp Z is still well-defined but the right- 
hand side of the above equation is problematic since for any non- 
trivial representation U, Ti is infinite-dimensional and T(Z) will, in 



* Since G is compact, it is exponential; that is, every group element 
can be written in the form exp Z for some Z. 
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general, be a non-skew-adjoint, unbounded operator. (Even the def- 
inition T(Z) = T(X) + iT(Y) becomes troublesome, since the skew- 
adjoint operators T(X) and T(Y) may both be unbounded and their 
domains may have little in common; additional assumptions must be 
made.) In the case of W, this was resolved by restricting exp[T(Z)] to 
act on analytic vectors. A similar approach is used in extending the 
present construction to non-compact G. For the present, we continue 
to assume that G is compact to avoid this problem. # 
Lemma 3.2.. 

(a) g I— > T(g) is an irreducible (non-unitary) representation of G c . 

(b) The map Z \— > T(expZ) is analytic as a map from the complex 
vector space g c to the complex matrices on H. 

Proof. If A is a matrix which commutes with all T(Z) for Z E g c , 
then in particular it commutes with all U(X) for leg, hence must 
be a multiple of the identity since U is irreducible. Therefore T is 
irreducible. To prove (b), note that Z \— > T(expZ) is, by definition, 
the composite of the two analytic maps Z \— > T(Z) and T(Z) i— > 
exp[T(Z)]. I 

It follows that the map T from G c (considered as a complex man- 
ifold; see Wells [1980]) to the group GL(H) of non-singular matrices 
on Ti is also analytic; that is, T is a holomorphic representation of 
G c , obtained by analytically continuing the representation U of G. 
Now the point of Onofri's construction is this: We have seen that by 
choosing a state which is stable under H, U can be reformulated as 
a representation of G on a space of functions /(to) defined over the 
homogeneous space G/H. In the case G = W, G/H was identified as 
a phase space, but in general it is not clear that it can be interpreted 
as such. Following Onofri, we will show that: 

(a) The representation T induces a complex structure on the homoge- 
neous space G/H, making it into a complex manifold on which G 
acts by holomorphic transformations. (Such a manifold is called a 
complex homogeneous space of G, or a holomorphic homogeneous 
G-space.) 

(b) In addition, G/H has the (symplectic) structure of a classical 
phase space, and the action of G on G/H is by canonical trans- 
formations. Thus it becomes possible to think of G/H as phase 
space. To actually identify G/H as the phase space of a classical 
physical system, i.e. as the set of dynamical trajectories followed 
by that system, it is necessary for G to include the dynamics for 
the system, i.e. its evolution group, of which nothing has been 
said so far. This will be discussed in the next chapter. 
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The representatives U(H) of the elements H of h form a com- 
muting set of skew-adjoint matrices, hence can all be diagonalized 
simultaneously. Let h be a common eigenvector: 

U(H) h = X(H) h, (37) 

where X(H) is imaginary. Since U is linear at the Lie algebra level, 
it follows that A is a linear functional on h, called the weight of h. 
(Roots are simply weights in the adjoint representation, where Ti is 
replaced by g c and U(H) by a,dH.) For any non-zero element Z a in 
g a with a in A+, we have (remembering that U(H) = T(H) since 
H e h) 

T(H)T(Z a )h=[T(H),T(Z a )} h + T(Z a )T(H)h 

= T([H,Z a ]) h + X(H)T(Z a )h (38) 
= (a(H) + X(H)) T(Z a ) h. 

That is, T(Z a ) "raises" the weight by a. Similarly, for a G A~, 
T(Z a ) lowers the weight by —a. Since non-zero vectors with different 
weights are linearly independent and 7i is finite-dimensional, it follows 
that Ti. must contain a non-zero vector with lowest weight, i.e. such 
that 

T(Z a )h = Va G A". (39) 

Equivalently, 

T(Z)h = VZen". (40) 

For the group W, h was the "ground state" xo and the above equa- 
tions correspond to T(—iE) xo = —ixo (so X(—iE) = —i) and Axo = 
0. 

Consider the subalgebra 

b = h c + n+ (41) 

of g c , called a Borel subalgebra. If N e n + and we denote by N 
its complex-conjugate with respect to the real subalgebra g, then 
iV G n~. Hence for arbitrary Z = H + N G b, we have 

T(Z)h = T(H)h + T(N)h = T(H)h, (42) 

and H belongs to h c since [H, h c ] = 0. Extending A by complex- 
linearity to h c , we therefore have 
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T(Z)h = X(H) h, (43) 

hence 

exp [T(Z)] h = exp [X(H)] h. (44) 

The subgroup B = exp(b) is called a Borel subgroup of g c . For Z as 
above, let b = exp Z E B,b = exp Z and 7r(6) = exp[A(.ff )]. Then 

T(6) /i = tt(6) /i. (45) 

Since X(H) is imaginary for if G h, complex-linearity implies that 
X(H) = -X(H) for H E h c . Hence 

tt(6) = tt(6) _1 , 6gR (46) 

The map 7r: S — > (C* satisfies 7r(6i&2) = tt(&i) tt(&2), i-e. it is 
a character of S. Furthermore, since 7r(6) is analytic in the group 
parameters of 6, Onofri calls it a holomorphic character of £?. 

Notice that the state corresponding to h (i.e., the one-dimen- 
sional subspace spanned by it) is invariant under B. If we restrict 
ourselves to the reai group G, this means that the state is invariant 
under the subgroup H. If H is the maximal subgroup of G leaving 
this state invariant, then the weight A is called non-singular. In that 
case, H plays the same role as it did in the last section: it is the 
stability subgroup of the state. We will assume this to be the case; if 
it is not (in which case A is singular), the present considerations still 
apply but in modified form. Note that in the non-singular case, the 
stability subgroup is abelian. 

We now adapt the construction of the last section in a way which 
respects the complex-analytic structure of G c . Introduce the notation 

(TO/)*)" 1 =T*(g), geG c . (47) 

Since the representation U of G is unitary, U(XJ* = —U(X) for 
X e g; hence for Z E g c , we have T(Z)* = -T(Z). It follows that 
for group elements g = exp(Z) of G c , 

T(g)# = T(g), g E G c , (48) 

where g = exp(Z) and, in particular, T(g)# = T(g) = U (g) for g E G. 
Define the vectors 
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h g = T{g)*h, geG c , (49) 

which, when restricted to g G G, coincide with the earlier frame vec- 
tors but are anti-holomorphic in the group parameters of G c . An 
arbitrary vector / gH defines a holomorphic function on G c by 

f{g) = (h g \f). (50) 

For arbitrary b E B, 



h gb = T(g)*T(b)*h = n(b)- 1 h 9 , (51) 

hence 

f{gb) = K{b)- l f{g). (52) 

Note: The reader familiar with fiber bundles (Kobayashi and No- 
mizu [1963, 1969]) will recognize the above equation as the condition 
defining a holomorphic section of the holomorphic line bundle asso- 
ciated to the principal bundle B — > G c — > G c /B by the character 
7r: B — > (D*. We now proceed to construct this section in a naive way, 
that is, without assuming any knowledge of bundle theory. # 

The above shows that the state determined by h g depends only 
on the left coset gB, which we denote by z. Let 

Z = G c /B (53) 

be the left coset space. Now 

(a) G c is a complex manifold; 

(b) B, as a complex subgroup, is a complex submanifold of G c ; 

(c) the projection map G c — > 2 is holomorphic. 

Hence it follows that Z is a complex manifold. This means that a 
neighborhood of each point z = gB e 2 can be parametrized by a 
local chart, i.e. a set of local complex coordinates (zi, ... , z n ) (say, 
with (0, . . . , 0) corresponding to z), and the transformation from one 
local chart to another on overlapping neighborhoods is a local holo- 
morphic function. In the case of G = W, B corresponds (under the 
contraction limit) to the complex subalgebra spanned by E and A, 
and Z can be identified with (E, hence only a single chart is needed 
to cover all of Z. In general, more than one chart is necessary. For 
G = SU(2), we will see that Z is the Riemann sphere S 2 : hence two 
charts are needed; however, the north pole has measure zero, and a 
single chart will do for 5' 2 \{oo} ~ (E. 
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Since G c acts on itself by holomorphic transformations, its action 
on Z is also by holomorphic transformations. This means the follow- 
ing: For <7i G G c and z = gB G Z, let w(z) = g\z = (gig)B. If 
(zi, ... , z n ) and (wi, . . . , tu n ) are local charts in neighborhoods of z 
and w, respectively, then the mapping 0: (zi, ... , z n ) \— > (wi, . . . , io n ) 
is holomorphic in a neighborhood of (0, ...,0). (It must, of course, 
be locally invertible with holomorphic inverse.) 

We could proceed as in section 3 and consider the states 

P z = '/> #T . z = gBeZ, (54) 



( h 9 I h g ) 



where we must now divide by since the non-unitary operator 

T(g) does not preserve norms. However, this would spoil the holomor- 
phy which we are attempting to study. Instead, proceed as follows: 
Choose an arbitrary reference point a in G c and define the holomor- 
phic coherent states 

As indicated by the notation, the right-hand side depends only on 
the coset z = gB. There is no guarantee that the denominator on the 
right-hand side is non-zero, but certainly the open set 

U ce = {geG c \ (h a \h g )^0} (56) 

(which, as indicated, depends only on the coset a = aB) contains a, 
and its projection V a to Z is an open set containing a such that xt 
is defined for all zmV a . Hence by choosing more than one reference 
point a, if necessary, we can cover Z with patches V a on which the 
X"' s are defined. 

An arbitrary vector / G TL can now be expressed as a local holo- 
morphic function 

f a (z) = ( X a z \f) (57) 
of z in V a , with transition functions 

jb, \ = (hg\h a ) ~ 

t{) (h 9 \h b ) f{Z) (58) 
= r b a (z)r(z), zeV aX a+ %. 
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The reader may wonder where this is all leading, since we are 
ultimately interested in the real group G and not in G c . Here is 
the point: It is known (Bott [1957]) that the complex homogeneous 
space Z = G c /B actually coincides with the real homogeneous space 
M = G/H used in Perelomov's construction! For example, consider 
the Weyl-Heisenberg group: G/H is parametrized by (x,p) while 
G c / B is parametrized by x — ip, and they are the same set but with 
the difference that the latter has gained a complex structure. The 
identification of M with Z in general can be obtained by noting that 
as a subgroup of G c , G acts on Z by holomorphic transformations; 
this action turns out to be transitive, and the isotropy subgroup at 
the "origin" zq = B is H, hence Z m G / H '. In other words, M 
inherits a complex structure from G c , and the natural action of G on 
M preserves this structure. Because of this, we need not deal directly 
with G c to reap the benefits of the complex structure. Let us therefore 
restrict ourselves to G. Then 

h 9 = (h a \h g } X a z = U(g)h, (59) 
hence || h g \\ = \\ h\\ =1 and the state corresponding to h g is 

\h g )(h g \ =\{K\h g )\ 2 \ x a z ){x a z \ 

= e-^ a) \x a z )(x a z \, 

where (f)(z, a) = — 2 In | ( h a \ h g ) | depends only on z = gH and a = aH 
and their complex conjugates (it is not analytic). Notice that in eq. 
(60), the left-hand side, hence also the right-hand side, is independent 
of a. Only the three individual factors on the right-hand side depend 
on a. This becomes important if several patches are needed to cover 
Z, since it means that we can change the reference point without 
affecting the smoothness of the frame. With the above definitions, 
the resolution of unity derived in section 3 becomes 

/ d m (z) e-^\ X a z )(x a z \ =1, (61) 

J M 

where we have assumed for simplicity that a single chart suffices. 
If more than one chart is needed, partition M as a (disjoint) union 
U n M n , where each M n is covered by a single chart. Since, by the above 
remark, the integrand is independent of a, the corresponding integrals 
I n form a partition of unity in the sense that Yl n = I- Therefore, 
the holomorphic coherent states form a tight frame which we call the 
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holomorphic G-frame associated with the representation U and the 
lowest-weight vector \. In this connection, note that a choice of lowest 
weight actually determines the representation U up to equivalence, 
hence a more economical terminology would be to call the above the 
holomorphic G-frame associated with x- The corresponding inner 
product is given by 



In the case G = W, since Z = (E, everything can be done globally: 
Just one chart is needed, and the reference point a can be fixed once 
for all. Taking a = 1 (the identity element of G) and a = 0, we find 
that the x^'s reduce to the canonical coherent states and 



Hence in the general case, takes the place of the Gaussian weight 
function: it corrects for the fact that holomorphic translations do not 
preserve the norm. The action of G on the space of local holomorphic 
functions f a (z) has a "multiplier": 



(/i|/ 2 )= / d m (z) f?(z) fc(z). (62) 



J M 



e^ (2 ' 0) = |(x,|xo) 



zz/2 



(63) 



(x a z \U(gi)f) 



(h g \U( 9l )f) 
(h g \h a ) 




(64) 



where z = gH, a = aH and 



l{z,g u a) 




(65) 



is holomorphic in z and anti-holomorphic in a. 

We have mentioned that M inherits a complex structure from 
G c . Actually, this is only part of the story. Let d and d denote the 
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external derivatives with respect to z and z, respectively, i.e., in local 
coordinates, 

df=^-dz k , df=^dz k (66) 
oz k oz k 

(summation over k is implied). Consider the 2-form 

oj = idd(j> = i d 2 ^_ dz 3 A dz k , (67) 

OZjOZ k 

where 4> is the function in the exponent of the weight function above. 
Theorem. 

(a) uj is closed, i.e. du = du = 0, 

(b) u> is independent of the reference point a, 

(c) uj is invariant under the action of G, and 

(d) uj is non-degenerate, if A is non-singular. 

Proof. We prove (a), (b) and (c). For the proof of (d), see Onofri 
[1975]. 

(a) follows from the fact that d+d = d is the total exterior derivative, 
hence the identity d 2 = implies 

d 2 = 0, B 2 = 0, 3d + dd = 0. (68) 

(c) follows from the fact that 7(2,(71,0;) is holomorphic in z, hence 
I7I 2 is harmonic and dd\"f\ 2 = 0. But eq. (65) shows that 

7(z,0i,a) = — ^— , 69 

exp[— (p(z, a)\ 

which implies that the pullback g*u of u under gi equals u, i.e. that 
uj is invariant. 

(b) follows from (c) and (p(z,gia) = ^(g^ 1 z,a). I 

The property (b) implies that u is defined globally on Z, whereas 
(a) and (d) mean that u is a symplectic form (Kobayashi and Nomizu 
[1969]) on Z, which makes Z a possible classical phase space. Finally, 

(c) means that the symplectic structure defined by uj is G-invariant, 
hence G acts on Z by canonical transformations. (Actually, the 2- 
form u> together with the complex structure define a Kahler structure 
on Z, i.e. a Hermitian metric such that the complex structure is 
invariant under parallel translations.) 
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3.5. The Rotation Group 

A simple but important example of the foregoing methods is provided 
by their application to the three-dimensional rotation group 50(3). 
The resulting frame vectors are known as spin coherent states. An 
excellent and detailed account of this is given in Perelomov [1986]; the 
treatment here will be fairly brief. SO (3) is locally isomorphic to the 
group SU(2) of unitary unimodular 2x2 matrices, which we denote 
by G in this section. This is the set of all matrices 

»=(°j £). M' + I0' = i, (i) 

hence G ~ S 3 (the unit sphere in H 4 ) as a manifold. The Lie algebra 
g has a basis { Ji, J 2 , J3} satisfying [J 1; J 2 ] = iJs plus cyclic permu- 
tations (where, as usual, it is actually iJk which span the real algebra 
g) which can be conveniently given as Jk = (l/2)<7fc in terms of the 
Pauli matrices 



Root vectors in g c are given by J± = J\ ± i J 2 , satisfying 

[J 3 ,J±] = ±J±, [J+,J_] = 2J 3 . (3) 
The vectors {J3, J±} form a complex basis for g c , with 

'-(Si)- H-)- w 

Unitary irreducible representations of G are characterized by a single 
number (highest weight) s = 0, 1/2, 1, 3/2, . . ., with the representation 
space having dimensionality 2s + 1. The generators Jk are repre- 
sented by hermitian matrices Sk satisfying the irreducibility (Casimir) 
condition 



s 2 = sf + s 2 2 + si = (i/2){s+s- + ss+) + si = s ( s + 1). (5) 



A basis for H is obtained by starting with a highest-weight vector v, 
i.e., 
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S+v a = 0, S 3 v s = sv s , (6) 

and applying S- repeatedly until the commutation relations imply 
that the resulting vector vanishes. This results (after normalization) 
in an orthonormal basis {v s , i> s _i, . . . , V- s } satisfying 

mv m 

y/(s - m)(s + m + 1) v m+1 (7) 

a/(s + m)(s - m + 1) Vm-i. 

To build a homogeneous frame as in section 3.3, we use a decomposi- 
tion of G in terms of Euler angles, 

g(4>, 9, ip) = exp(-z</>J 3 ) exp(-i<9J 2 ) exp(-^Js), (8) 

with < 4> < 27T, < 9 < tv and < ip < 4/r, which gives a 
corresponding decomposition of U(g) as 

U((f),6,ip) = exp(— i(j)Ss) exp(—i0S2) exp(— ii^S^). (9) 

If 2s is odd, (j), 9 and ift have the same ranges as before; if 2s is even, 
then ^ + 2tt and ^ give the same operators, hence < ^ < 2tv. If 
we choose one of the basis vectors v m as our initial vector h, then the 
stability subgroup is H = {g(0,0,ijj)} ~ S . The homogeneous space 
G/H ~ 5' 3 /5' 1 is parametrized by (4>,9), or by the unit vectors n = 
(cos (j) sin 9, sin sin 9, cos 0), hence G/if can be identified with the 
unit sphere S 2 . Choosing the section a: S 2 — > G as cr(0, 0) = (0, 9, 0), 
we obtain the frame vectors 

h n = e~ % ^ e- t6S2 h. (10) 
The G-invariant measure on S 2 is just the area measure 

dn = sin9d9d(t>, (11) 

and the tight frame is 

/ dn\h n )(h n \ = cl (12) 
Js 2 

for some number c. To find c, take the trace of both sides and use the 
fact that \h n )(h n \ is a rank-one projection operator (and hence its 
trace is unity). This gives 



SzV m = 
S+V m = 
S-V m = 
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47r = cTr/ = c(2s + l), (13) 
hence the resolution of unity is 
2s + 1 

l S 2 



[ dn\h n }(h n \ =1. (14) 
Js 2 



4n 

The overlap between frame vectors can be shown to be 

1 , /\ 2s 

1 + n • n 



\(hn\h n ,}\'=^— j , (15) 

hence they are orthogonal if and only if n' = — n. 

To construct a holomorphic frame, we must consider the complex 
Lie algebra g c and its Lie group G c = SL{2, (E ) of unimodular 2x2 
complex matrices 

9= (° j), aS-P 1 =l. (16) 

The Lie algebra h of the subgroup H of G used above is a Cartan 
subalgebra of g, and h c = (DJ 3 . The corresponding root-space de- 
composition is 

g c = h c + n+ + n- =(DJ 3 + (DJ + + (DJ_, (17) 

and this yields a Gaussian decomposition of (almost all) elements of 
G c as products of lower-triangular, diagonal and upper-triangular 
matrices: 

<7(C,d,£) = e CJ - e 2dJ *eZ J + 

1 0\ / e d \ f 1 £ 



C I J V e~ a J V 1 ' ^ 



= c- d e+- 

If we write A^ ± = exp(n ± ), then the above decomposition is G c ~ 
N~ H C N + . Comparison with the original form of g gives 

a = e d , [3 = e d £, 

(19) 

7 = Ce d , d = Ce d i + e~ d . 
We call C(fi')? £(#) an d e d = a((7) the Gaussian parameters of g. Thus 
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a(g) = a, ((g) = 7/«, Z(g) = P/ot. (20) 



Remarks: 

1. Matrices with a = 0, i.e. of the form 



(3 

-/r 1 s 



(21) 



clearly do not have this decomposition. They form a 2-dimen- 
sional complex submanifold of G c , hence have (group-) measure 
zero. 

2. Elements of G are distinguished by 5 = a and 7 = — f3, which 
implies 

aa = (1 + CC) _1 = (1 + «) _1 - (22) 

The Borel subgroup discussed in section 3.4 is B = H C N + and 
consists of all matrices of the form 

»=(S (23) 

Its cosets (-B can therefore be parametrized by £ G (D. The unattain- 
able matrices with a = form a single coset, corresponding to ( = 00. 
Hence the homogeneous space G c /B is the Riemann sphere: 

Z = G c /B « (C U {cx)} « S 2 , (24) 

in agreement with our earlier G/H = S 2 but with an added complex 
structure, as claimed in section 3.4. The action of G c on Z is as 
follows: If g('_B = C"5, i.e. 

then 

= = (26) 

That is, G c acts on 2 by Mobius transformations. 

We defined /i 5 = T(g)#h, where T(g) was the analytic continua- 
tion of U(g) to G c and T(#)# = (T(#)*) _1 • But 
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T(C-dt + )*=T(C + )*T(d)#T(t-)# 

= exp(-C-S , +)exp(-2dS , 3 )exp(-eS , _). 

Hence the unique state left invariant by B is that corresponding to 
the vector of lowest weight, h = V- s . For this choice, we get 

h g = e 2ds exp(-CS+) h = e 2ds h c . (28) 

The holomorphic coherent states with respect to the reference point 
<7i G G c are then 

gi _ h 9 _ e~ 2sdl h c 

x < ~ (h 91 \h g ) ~ (h Cl \h c v 111)1 



with 



(h Cl \h c ) = (h\eM-dS-) eM-(S+)h). (30) 



To evaluate this, express exp(— (iS-) exp(— £5+) in the opposite fac- 
torization N + H C N~ and use S-h = 0. It suffices to do the compu- 
tation at the level of 2x2 matrices since the result depends only on 
the commutation relations, which are preserved by the representation. 
Thus 



-(; ?)(■: .-"<■)(? :)■<!*<•- 

which gives 

e- d ' = l + CiC, e- d 'C = -Ci, i'e- d ' = -l (32) 

Hence 

(h Cl \h c ) = (h\T(<;' + d'£)h) 

= e--' = (l + OC) 2s , ( ' 



and 
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Xf = e" 2sdl (1 + CiCT 25 V (34) 

Since a single chart covers all of Z S 2 except for the point at £ = oo, 
just one reference point gi is needed in this case. The simplest choice 
is <7i = 1, the identity of G, which gives 

X\ = h. (35) 
With this choice, the weight function is 

e-*<« = \{h\h g )\\ g = C-di + 

= e 2 < d+ ^\(h\h c )\ 2 (36) 

_ e 2s(d+d) 

But for g G G we have the constraint 

e 2 8 (d+d) = | Q |2 = + (37) 

hence 

e-^) = (l + CC)- 2s - (38) 

To find the invariant measure on G c /B, recall that the 2-form uj = 
idd<p is invariant under G c and non-degenerate. Hence it defines an 
invariant measure, once we choose a positive orientation on G c /B. 
An easy computation gives 

u = -2zsd<91n(l + CC) 

2isdC A d( (39) 



(1 + CC) 2 



^ I 7T d ^^\h){h\=cI (40) 



Thus 



f(D (l + CC) 2 S +2 

for some c. Taking the trace and using 

(h c \h c ) = (l + (0 2s (41) 

gives, with £ = re* e , 
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f rdv 

8 ™/ o (TT7^ = c(2s + 1) ' (42) 

from which c = 4ns/ (2s + 1). Thus we have the resolution of unity 

where <i 2 £ is now Lebesgue measure on (E. 

What do the functions f(() = (h^ \ f) look like? Consider the 
vectors 

u n =^(-S+) n h, n = 0,1,2,-... (44) 
n! 

Then -u = h,u±, . . . ,U2 S are linearly independent and U2 S +i = 0, 
since Sj s+1 = 0. Thus 

h c = e -~ cs +h , s 

- - (45) 

= u + ( Ul + ■■■ + (() 2s u 2s , 

hence 

/(C) = /o + C/i + "- + C 2 72 S , (46) 

where f n = (u n \ / ). Thus, /(C) is a polynomial of degree < 2s in £. 

How are the two sets of frame vectors h n and related? It 
turns out that n is related to ( by stereographic projection of the 
2-sphere onto the complex plane. The exact relation depends on the 
particular factorizations of G used to construct the frames. In the 
case of /i n , this was the Euler angle decomposition, whereas for it 
was the Gaussian decomposition. However, there is an intrinsic way 
of relating the two sets of vectors, which goes as follows: Consider the 
functions 

*<0 = TW' (47) 

which are the expectations of the observables Sk in the state of h^, 
and which correspond to the components of the classical angular mo- 
mentum of a system whose only degrees of freedom are the rotational 
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motions given by G. Sk(C) is a quantum-mechanical version of a 
statistical average. Since = ex.p(—(S + )h, we have 



s + (0 = -^Hk\k 

2s( 



(48) 



1 + CC 



To find 5 3 (C), note that [S 3 , S+] = S+ implies [S 3 , S£] = nS$, hence 



S 3 ,e 



and 



S 3 h c = -(S+h c + e-t s +S 3 h 
= - {(S+ + s) h c . 



(49) 



(50) 



Hence 



S 3 (() = -s -(§+(() = s 



CC-i 
CC + i 



(51) 



The equator of S 2 corresponds to S3 = 0, hence to \(\ = 1, and the 
south pole (S 3 = —s) and north pole (S3 = s) to ( = and ( = 00, 
respectively. The above equations imply that 



s 2 (c) = Si(C) 2 + &(C) 2 + &(C) 2 
= \s + (0\ 2 + s 3 (0 2 



(52) 



thus S(C) belongs to the 2-sphere of radius s centered at the origin. In 
fact, the correspondence ( <-> S(C) is a bijection if we include ( = 00. 
The relation 



s+(C) 

*-&(C) 



= -c 



(53) 



shows that — ( is just the stereographic projection of S(£) from the 
north pole to the complex plane tangent to the south pole. We could 
choose s as a new independent variable ranging over the 2-sphere of 
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radius s minus the north pole and define h s = h^, where ( is the unique 

point with S(£) = s. Then the vectors h s are essentially equivalent to 
the fin's. Note that they are eigenvectors of the operator s • S with 
eigenvalue s 2 , i.e., 

s • Sh s = s 2 h s , (54) 

since the equation obviously holds for s = (0,0, — s), hence for all s 
by symmetry. 



3.6. The Harmonic Oscillator as a Contraction Limit 

In section 3.4, we gave a heuristic argument suggesting that the Weyl- 
Heisenberg group W is a "contraction limit" of 577(2). This can 
now be made precise and given a geometric interpretation at the rep- 
resentation level. Furthermore, imitating the analysis of the non- 
relativistic limit of Klein-Gordon theory (next chapter) , we shall gain 
an understanding of the relation between the Harmonic Oscillator and 
the canonical coherent states in the bargain. This connection between 
the rotation group and the harmonic oscillator has some potentially 
important applications in quantum field theory, which I hope to ex- 
plore in the future. 

We will study the limit of the representation of G = 577(2) with 
spin s as s — > oo. Since s is now a variable, we denote the represen- 
tation space by Ji s and the holomorphic coherent states by h^. The 
matrices representing the generators Jk will be denoted by 5|. By 
way of motivation, compare the resolution of unity on Ti s , 

^i^(i + ccr 2 '- 2 i^>w (i) 

with that on the representation space TC of W in terms of the canonical 
coherent states, 

h l d2ze ~~ ZZ/2 \ Xz){Xz \ =L (2) 

Note that if we set 



(3) 
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and define x% = hs, then the resolution of unity on H s becomes 



2s + i r ( zz \-( 2s + 2 ) 

" ! ' ' d 2 z 1 + ^^ \XI)(X S A =!■ (4) 



4tt(s + 1) Jq V 2(2s + 2) 

If we now take the formal limit s — > oo, this coincides with the reso- 
lution of unity on 7i, provided we can show that xt ~^ Xz- Our task 
is now (a) to find the sense in which this limit is to be taken, (b) to 
show that the generators Sf. of g go over to the generators A, A* and 
/ of w and (c) to show that the coherent states xt g° over to the 
canonical coherent states Xz- 

To properly study the limit s — > oo, we will first of all imbed all 
the spaces H s into Tt, so that the limit may be considered within 7i. 
This is done most easily by using the orthonormal bases obtained by 
applying S+ and A* to the respective "ground states" . An orthonor- 
mal basis for H is given by 



w r 



= (2 n niy 1/2 (A*) n Xo , n = 0,1,2,..., (5) 



where Axo = and the normalization is determined by the commu- 
tation relation [A, A*] = 21. The generators A and A* act by 



(6) 



Aw n = v2nw„_i 
A*w n = V2n + 2w n+ i. 
An orthonormal basis for TC S is given by 



^^W^' n = 0,1, 2,..., 2s, (7) 

where S^_h s = 0. We imbed H s into H by identifying with w n and 
defining Sf.w n = for n > 2s. Then for < n < 2s, 

S^_w n = a/ n(2s — n + 1) w n -i 

S s + w n = y/(n + l)(2s-n) w n+1 (8) 

S^w n = (n - s)w n . 

To see how the generators must be scaled, note that the relation 
between ( and z implies that 



84 



3. Frames and Lie Groups 



h 8 c = exp(-CS;)/i s = exp(zK+/2)w = xl (9) 

where 

= vki- (10) 

Define K s _ = S s _/yfs + l, so that K s _ = {K%)\ and 

KS = -\K1,K S ] = (11) 

3 2 L + > -J s + 1 V ) 

Then for < n < 2s, 



n(2s-n + l) 

K s _w n = \l -i — ^ w n _i 

s + 1 



(n + l)(2s-n) (i2) 

K + Wn = v — iTi — 

^3™n = — 7««n- 

If we now take the limit s — > oo while keeping n fixed, we obtain 

K s _w n — > v^2nty n -i 

iq.w; n -> V2?i + 2«; n+ i (13) 
K|w n -> -w n . 

Comparing this with the action of A and A*, we see that 

K s _ — > A 

-> A* (14) 
tf| - -J 

as s — > oo in the weaic operator topology of 7Y. (These limits are not 
valid in the strong operator topology since it is necessary to keep n 
fixed.) Thus we have shown that in the weak sense, the representation 
of SU(2) goes over to a representation of W as s — > oo. Note that the 
operator 

N s _ = S s 3 + s, (15) 
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which satisfies 

N!Lw n = nw n , < n < 2s, 



(16) 

N^_w n = sw n , n > 2s, 



has the weak limit 



w-limATl =N= -A* A, (17) 

which is the Hamiltonian for the harmonic oscillator (minus the 
ground-state energy) . If we write A = X + iP with X and P self- 
adjoint, then 

N = - (X 2 + P 2 - I) . (18) 
The fact that — > A* means that 

X s z = exp {zK s + /2) w — > exp(^72)«;o = Xz, (19) 

where Xz are the canonical coherent states and the convergence is in 
the weak topology of H. 

We now examine the limit s — > oo from a global geometric point 
of view, using the coherent states. Recall that the expectation vector 

(h s t \S s h s t ) 

SS(C) s w < 20 » 

ranges over the sphere of radius s centered at the origin, with the 
north pole corresponding to ( = oo. The transformation from Sf. to 
Kl deforms this sphere to an ellipsoid, 



1*1(01 



2 



+mo 2 = (— t) • (2i) 



s + l V s + 1 

When s — > oo, this ellipsoid splits into the two planes — > ±1. 
Our weak limit — > —7 only picked out the lower plane. We could 
have picked out the upper plane by imbedding H s into H differently, 
namely by identifying W2 S -n with w n for n = 0, 1, 2, ... , 2s. In that 
case we would have obtained the weak limits 
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K s _ -> A* 

K s + — > A (22) 

L 3 



In terms of the coherent states, this corresponds to using the highest- 
weight vector instead of the lowest-weight vector or, equivalently, 
using a chart centered about the north pole rather than the south 
pole, for example, by using as reference point the element g\ = e~ l7rj2 . 
The corresponding harmonic-oscillator Hamiltonian is the weak limit 
of the operator 

N s + = s-S s 3 =2s-N s _. (23) 
The expectation values of Ni and N+, 

2<C 



ivi(0 



+y^i 1 + cc 



are related by inversion: 



NUO = NL(l/0- (25) 

The splitting of the ellipsoid into the two planes can be under- 
stood from the point of view of representation theory by writing the 
irreducibility condition in terms of the ICs: 

2^ ( K + K - + K - K +) + ^ = iU 1 - (26) 

When s — > oo, this implies formally that (7T|) — > 7. The subspaces 
H± on which 7C| — > ±7 are invariant in the limit, hence the limiting 
representation of W is reducible. Evidently, by taking the limits in the 
weak topology we are able to pick out just one irreducible component 
at a time. 

There is an interesting analogy between the above analysis and 
the non-relativistic limit of Klein-Gordon theory, which we now point 
out for those readers already familiar with the latter. (The non- 
relativistic limit will be discussed in the next chapter.) The relation 
Ss/(s + l) — > ±7 corresponds to Po/mc 2 — > ±7 in the limit mc 2 — > oo, 
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where Pq is the relativistic energy operator. As will be seen in the 
next chapter, the Poincare group contracts, in this approximation, to 
a semidirect product of the 7-dimensional Weyl-Heisenberg group and 
the rotation group. A first-order correction is obtained by expanding 



P = ±Vm 2 c 4 + P 2 c 2 ~ ± (mc 2 + P 2 /2m) = ± (mc 2 + H) , (27) 

where H is the non-relativistic free Schrodinger Hamiltonian. Simi- 
larly, it follows from 



1 



(S s + S s _ + S s _Sl) + (SI) 2 = s(s + 1) 

1 



" (28) 



(S s + S 8 _ -S s _Sl) - SI = 
that 



(S 3 S - hf = (s + i) 2 - S' + S'_, (29) 



hence formally 



S s S s (30) 



2s + 1 
from which 



S?-> ~ -a + -K> + KL, 

2 (31) 

^ 3 S(+) ~s + l--K s + K s _, 

which corresponds to Pq ~ ±(mc 2 + H). The analogy between 
the large-spin and the non-relativistic limits can be summarized as 
follows: The Poincare group corresponds to SU(2) : the energy P to 
S3, the rest energy mc 2 to — s, and the non-relativistic Hamiltonian H 
to the harmonic oscillator hamiltonian N . The analog of the central 
extension of the Galilean group (which is the contraction limit of the 
Poincare group, as discussed in the next chapter) is the Oscillator 
group (Streater [1967]), whose generators are A, A*, I and N, with 
the commutation relations 
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[N, A] = -A, [N, A*} = A*. (32) 

Finally, we note that the above analysis explains a well-known 
relationship between the harmonic oscillator and the canonical coher- 
ent states, namely that the latter evolve naturally under the harmonic 
oscillator dynamics. We first derive the corresponding relation within 
SU{2), i.e. for finite s. Consider the behavior of under the "evo- 
lution operator" exp(— itNt)- We wish to express 

e- itN - X s z = e- lts e- us "e-^ s +h s (33) 

in terms of the coherent states x%i hence we need to display the op- 
erator on the right in the reverse Gaussian form 

exp(-C'S;) exp(2d , 5 3 ) exp(£'S_). (34) 
As explained earlier, it suffices to do the computation on 2x2 matrices: 

6 6 " ^ e^ 2 ){0 1 ) 

= {0 1 ){0 e- d '){e l) 
This gives d! = -it/2, (' = e u ( and = 0. Hence 

e- itN3 x s z = e- its e-? s le- its sh 
= e -~ c ' s +h = h c 

= X z {t)i 

where z(t) = e lt z. (This is intuitively obvious, since exp(— itS^) ro- 
tates the 1-2 plane clockwise by an angle £, hence it rotates the co- 
ordinate z counterclockwise by an angle t.) In the limit s — > oo, this 
gives the well-known result (Henley and Thirring [1962]) that the set 
of canonical coherent states is invariant under the harmonic oscillator 
time evolution, with individual coherent states moving along the clas- 
sical trajectories z(t) determined by the initial conditions z = x — ip in 
phase space. The above shows that the same is true within SU (2), i.e. 
for finite s, where it is a consequence of the fact that Ni is essentially 
the generator of rotations about the 3-axis. 

Note: After this section was written, I learned from R. F. Streater 
that a related construction was made by Dyson [1956] . 



(35) 



(36) 
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4.1. Introduction 

Relativistic quantum mechanics is a synthesis of special relativity and 
ordinary (i.e., non-relativistic) quantum mechanics. The former is 
based on the Lorentzian geometry of spacetime, while the latter is 
usually obtained from classical mechanics by a somewhat mysterious 
set of rules known as "quantization" in which classical observables, 
which are functions on phase space, suddenly become operators on 
a Hilbert space. Classical mechanics, in turn, can be formulated in 
terms of Newtonian space-time (the Lagrangian approach) or it can 
be based on the symplectic geometry of phase space (see Abraham 
and Marsden [1978]). The latter, called the Hamiltonian approach, 
is usually considered to be deeper and more powerful, and its study 
has virtually turned modern classical mechanics into a branch of dif- 
ferential geometry. Yet, the standard formalism of relativistic quan- 
tum mechanics rests solely on the geometry of spacetime. Symplectic 
geometry, so prominent in classical mechanics, seems to have disap- 
peared without a trace. 

In this chapter we develop a formulation of relativistic quantum 
mechanics in which symplectic geometry plays an important role. This 
will be done by studying the role of phase space in relativity and dis- 
covering its counterpart in relation to the Poincare group, which is 
the invariance group of Minkowskian spacetime. It turns out that the 
Perelomov construction fails for relativistic particles (the physically 
relevant representations are not square-integrable), and an alterna- 
tive route must be taken. The result is a formalism based on complex 
spacetime which, we show, may be regarded as a relativistic extension 
of classical phase space. As a by-product, two long-standing inconsis- 
tencies of relativistic quantum mechanics (in its standard spacetime 
formulation) are resolved, namely the problems of localization and 
covariant probabilistic interpretation. Rather than being sharply lo- 
calizable in space (which leads to conflicts with causality; see Newton 
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and Wigner [1949] and Hegerfeldt [1985]), particles in the new formu- 
lation are at best softly localizable in phase space. This is just a covari- 
ant version of the situation in the coherent-state representation. But 
whereas for non-relativistic particles both the Schrodinger represen- 
tation and the coherent-state representation give equally consistent 
theories, the spacetime formulation of relativistic quantum mechanics 
is inconsistent because it lacks a genuine probabilistic interpretation, 
a situation remedied by the phase-space formulation (section 4.5). 



4.2. Relativity, Phase Space and Quantization 

At first it appears that phase space and spacetime are mutually ex- 
clusive: The phase-space coordinates of a particle are in one-to-one 
correspondence with the initial conditions which determine its classi- 
cal motion, i.e. its worldline. Hence the phase space can be identified 
with either the set of all initial conditions or the set of worldlines of 
the classical particle. In either case, time is treated differently from 
space: For initial conditions, an arbitrary "initial" time is chosen; 
for world-lines, the dynamical "flow" is factored out. Furthermore, 
locality in spacetime is lost in either case. 

In this section we confine ourselves to the physical case s = 3, i.e., 
three-dimensional space. Our approach will be to leave spacetime 
intact and, instead, consider its group of symmetries, the Poincare 
group, which is defined as follows: Let it be a four-vector. We denote 
its time component (with respect to an arbitrary reference frame) 
by u° and its space components by u = (it 1 , w 2 , k 3 ). The invariant 
Lorentzian inner product of two such vectors is defined as 

(it, v) = uv = c 2 u°v° — u • v = g^ u u^v u '. (1) 

Later we will set c = 1 (which amounts to measuring time as the 
distance traveled by light), but for the present it is important to 
include c, since the non-relativistic limit c — > oo will be considered. 

The Lorentz group C is the set of all linear transformations 
A : IR 4 — > IR 4 which leave the inner product invariant: 

(Aw, Av) = (u, v) Vu, v G M 4 . (2) 

C includes transformations which reverse the orientation of time 
and ones which reverse the orientation of space. Such space- and time 
reflections split C into four connected components. The component 



4.2. Relativity, Phase Space and Quantization 



91 



connected to the identity (whose elements reverse neither the orien- 
tations of time nor of space) is called the restricted Lorentz group 
and denoted by Co- If we let u 4 = cu°, so that uv = —u ■ v + u 4 v 4 , 
we can identify Co with 50(3, 1) + (the plus sign indicates that the 
orientation of time, hence also of space, is preserved separately.) The 
Poincare group V is defined as the set of all Lorentz transformations 
combined with spacetime translations: 

V = {(a,A)|A G C,a G M 4 } (3) 
where (a, A) acts on IR 4 by 

(a, A)u = Au + a. (4) 

We will be dealing with the restricted Poincare group Vo where 
A G Co- The reason for our interest in Vo is that it parametrizes all 
reference frames which can be obtained from a given reference frame 
by a continuous motion. (By "motion" we mean any spacetime trans- 
lation, rotation or boost, including physically impossible "motions" 
such as space-like translations and translations backwards in time.) 
Now to specify a reference frame (a, A) relative to some fixed refer- 
ence frame (located, say, at the origin in spacetime), we must give 
its origin (namely, a), its velocity and its spatial orientation (all 
relative to the fixed frame). Thus if we ignore the spatial orientation 

by factoring out the rotation subgroup SO (3) of C\ , the resulting 
seven-dimensional homogeneous space 

C = V /SO(3) (5) 

can be identified with the set of positions in spacetime (events) to- 
gether with all possible velocities at these events. The set of all 
(future-pointing) four-velocities is a hyperboloid 

0+ = {u G IR 4 | u 2 = (u, u) = c 2 , u° > 0}. (6) 

Thus 

C^IR 4 xO+, (7) 

which is an extension of classical phase space obtained by including 
time along with the space coordinates. Such an object is usually called 
space. Strictly speaking, C is an extended velocity phase space 
rather than an extended momentum phase space; this appears to be 
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the price for retaining locality in spacetime. What matters for us 
is that it will have the required symplectic (more precisely, contact) 
structure. From its construction, it is clear that C is a relativistically 
covariant object; it is not invariant since the choice of SO (3) in Vo 
is frame-dependent. We will see that C combines the geometries of 
spacetime and phase space in a natural way. 

Thus, rather than conflicting with relativity, the concept of phase 
space actually follows from it: Appending time to the geometry of 
space means appending velocity-changing transformations (which are 
just rotations in a space-time plane) to the group of rigid motions, 
hence the enlarged group contains velocity coordinates in addition to 
space coordinates. In a sense, Vo itself is actually a "super phase 
space" since it furthermore contains information on the spatial orien- 
tation, which is needed to include spin degrees of freedom along with 
the translational degrees of freedom. Vo even has a natural general- 
ization to the case of curved spacetime, namely the frame bundle of 
all "orthonormal" frames (with respect to the given curved metric) 
at all possible events. (For the definition and study of frame bundles, 
see Kobayashi and Nomizu [1963, 1969].) 

In our review of canonical coherent states (chapter 1), we saw that 
the classical phase space resulted from the Weyl-Heisenberg group W, 
which, unlike Vq was not a symmetry group of the theory but merely 
a Lie group generated by the fundamental dynamical observables of 
position and momentum at a fixed time. We will now see that W 
is related to the non-relativistic limit of Vo in two distinct ways: as 
a normal subgroup, and as a homogeneous space. This insight will 
play a key role in generalizing the canonical coherent states to the 
relativistic case. It turns out that the role of W as a group has no 
relativistic counterpart, whereas its role as a homogeneous space does: 
its relativistic generalization is C. 

Consider the Lie algebra p of Vo, which is spanned by the gener- 
ators Pk of spatial translations, Po of time translations, J k of spatial 
rotations and K k of pure Lorentz transformations (the "boosts"). The 
Lie brackets of p are given by 

[Jj, Jk] = iJi [Jj, K k ] = %Ki 

[P ,K r ] = iP r [J J ,P k ]=iP l (8) 

[Kj, K k ] = -ic- 2 J t [P r , K s ] = ic- 2 5 rs P 

where (j,k,l) is a cyclic permutation of (1,2,3), r,s = 1,2,3 and all 
unspecified brackets vanish. The physical dimensions of these gener- 
ators are as follows: Pq is a reciprocal time, P k is a reciprocal lenght, 
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Jfc is dimensionless (reciprocal angle) and Kk is a reciprocal velocity. 

Notice that so far nothing has been said about quantum me- 
chanics, p is simply the Lie algebra of the infinitesimal motions of 
Minkowskian spacetime, a classical concept. (The unit imaginary % in 
eq (9) can be removed by replacing with —iP^, Jfc with —iJk and 
Kk with —iKk] i is included because we anticipate that in quantum 
mechanics these generators become self-adjoint operators.) Quantum 
mechanics is now introduced through the following postulate: 
(Q). The formalism of relativistic quantum theory is based on a uni- 
tary (though possibly reducible) representation of Vq. 
That is, the representation provides the quantum-mechanical Hilbert 
space, and the generators of p, which by unitarity are represented by 
self-adjoint operators, are interpreted as the basic physical observ- 
ables: Po as the energy, Pk as the momentum and Jfc as the angular 
momentum. One may then consider perturbations by introducing in- 
teractions or gauge fields. In fact, the assumption (Q) is very general 
in scope; it serves as one of the axioms in axiomatic quantum field 
theory (Streater and Wightman [1964]). Unlike the usual prescription 
of "quantization" in non-relativistic quantum mechanics, (Q) is both 
mathematically and physically unambiguous. Yet, (Q) implies and, 
at the same time, supercedes the canonical commutation relations! 
To see this, consider the non-relativistic limit c — > oo of p. Letting 

c~ 2 P = M, (9) 

we see that in the limit c — > oo, p "contracts" to a Lie algebra gi 
with generators M, Pk, Jk, Kk satisfying 

[Jj, Jfc] = iJi [Jj, K k ] = iKi 

[M,K r }=0 [J J ,P k ]=iPi (10) 

[K jt K k ] = [P r ,K s ] = i5 rs M 

and all other brackets vanishing. Note that (a) M is a central element 
of gi and (b) M, Pk and Kk span an invariant subalgebra w of gi 
which is isomorphic to the Weyl-Heisenberg algebra, with M playing 
the role of the central element E. Hence if Q\ denotes the connected, 
simply connected Lie group generated by gi, then the invariant sub- 
group of Q\ generated by w is isomorphic to the Weyl-Heisenberg 
group W. The remaining generators Jfc of gi span the Lie algebra 
so(3) of the spatial rotation group SO(3), so Q\ is the semi-direct 
product of W with SO (3) : 



Q l = W®SO(3). 



(11) 
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Now suppose that the unitary representation of V\_ in assumption 
(Q) is irreducible. [Assumption (Q) means that we are dealing with 
a general quantum system, possibly a system of interacting particles 
or even quantum fields; it is the additional assumption of irreducibil- 
ity which makes this system elementary, roughly a single particle. 
Hence the concept of position, discussed below, is only now admissi- 
ble.] Assuming that the formal limit c — > oo of Lie algebras rigorously 
induces a corresponding limit at the representation level (and this is 
indeed the will be shown later), assumption (Q) implies 

that we have a unitary irreducible representation of Q\ in that limit. 
Then M, P k and K k are represented by self-adjoint operators on some 
Hilbert space, which we will denote by the same symbols. Irreducibil- 
ity implies that the central element M has the form ml, where m is a 
real number and I denotes the identity operator. Assume m > (this 
is physically necessary since m will be interpreted as a mass) and let 

X k = -(l/m)K k . (12) 

Then eq. (10) shows that X k and P k satisfy the canonical commuta- 
tion relations: 

[X r ,P s )=i8 rs I, (13) 

thus X k behaves like a position operator. This shows that the assump- 
tion (Q), which is conceptually simple, mathematically precise, rela- 
tivistically invariant and very general, actually implies the much less 
satisfactory "quantization" prescription in the non-relativistic limit, 
under the additional assumption of irreducibility. 

How does it happen that classical relativistic geometry, as repre- 
sented by Vo, when combined with assumption (Q), yields the mys- 
terious canonical commutation relations? To understand this, note 
that eq. (10) came from the relativistic Lie bracket 

[P r ,K s ] = ic- 2 5 rs P (14) 

which states that boosting (accelerating) in any given spatial direc- 
tion does not commute with translating in the same direction. This, in 
turn, is a consequence of the fact that Einsteinian space is not absolute 
since in the accelerated frame there is a (Lorentz) contraction in the 
direction of motion, so first translating and then boosting is not the 
same as first boosting and then translating. In the non-relativistic 
limit this gives the canonical commutation relations. No such easy 
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derivation of these relations would have been possible without in- 
voking Relativity. For had we begun with Newtonian space-time, the 
appropriate invariance group would have been not Vq but the Galilean 
group Q. Since Newtonian space is absolute and hence unaffected by 
boosting to a moving frame, the Galilean boosts Kk commute with 
with the Galilean generators of translations Pk , hence yield no canon- 
ical commutation relations and no associated uncertainty principle. In 
the case of Qi, the canonical commutation relations are a remnant of 
relativistic invariance. Thus the uncertainty principle originates, in 
some sense, in "classical" Relativity theory! 

Eq. (12) states that an acceptable set of position operators for a 
non-relativistic particle is given in terms of the generators of Galilean 
boosts (more precisely, the boosts of a central extension of the Gali- 
lean group, as explained below). It is interesting to see how this comes 
about from a more intuitive, physical point of view, since position op- 
erators are problematic in relativistic quantum mechanics (as will be 
discussed later) but the boosts have natural relativistic counterparts. 
To gain insight, we will now give two additional rough but intuitive 
arguments for the validity of eq. (12). 

1. For a spinless particle, the generators of the Poincare group can be 
realized as operators on a space of functions over spacetime (namely, 
the space of solutions of the Klein-Gordon equation) by 

d 

Po = i- 



dx 

Pk = - l lk (15) 

Jk %lPm %m,Pl 

K k = x P k - c~ 2 x k Po 

where (k,l,m) is a cyclic permutation of (1,2,3). In the non-relativ- 
istic limit, P — > mc 2 , so 



-(l/m)K k -» x k - x Pk/m, (16) 

which displays — (l/m)Kk as the operator of multiplication by Xk (the 
usual non-relativistic position operator) minus the distance the par- 
ticle has traveled in time t = xq while going at a velocity v = P/m. 
This is just the initial position of the particle at time t = 0; that is, the 
non-relativistic limit of —(l/m)Kk is just the (non-relativistic) posi- 
tion operator in the Schrddinger picture, where operators are constant 
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while states vary with time. 

2. The second explanation of eq. (12) begins with the relativistic 
state space C ~ H 4 x fi+ and considers the non-relativistic limit. 
As c — > oo, the hyperboloid fi+ flattens out to a (three-dimensional) 
hyperplane at infinity, the non-relativistic velocity space V ~ IR 3 . 
In that limit, the boosts K k commute and become the generators of 
translations in V. If the cartesian coordinates of V are Vk, then 

K k - -i^-. (17) 

Now the velocities v k are related to the momenta p k by v k = Pk/ m i 
where m is the mass. Thus in the limit 



K k = -mi — . (18) 
dp k 

But in the momentum representation of non-relativistic quantum me- 
chanics , the position operators are represented by 

X k = i^-, (19) 
dp k 

which again gives agreement with eq. (12). 

Now that we have an acceptable relativistic generalization of the 
classical phase space, let us return to our goal of extending the canon- 
ical coherent-state representation to relativistic particles. We have 
seen that the Weyl-Heisenberg group, on which this representation is 
based, is isomorphic to an invariant subgroup of the non-relativistic 
limit of V\. However, this subgroup is not the non-relativistic limit 
of any subgroup of Vq, since the Lie brackets of K k ,P k and P do 
not close due to 



[Kj, K k ] = -ic~ 2 J h {j, k, I cyclic ). (20) 

That is, we cannot simply generalize the canonical coherent states by 
choosing the right subgroup of Vq. However, eq. (11) shows that W 
also plays another role in the group Qi, namely as a homogeneous 
space: 



W = Gi/SO(3). 



(21) 



4.2. Relativity, Phase Space and Quantization 



97 



In this form, it does have an obvious relativistic counterpart, namely 
C. In retrospect, the role of W as a group is purely incidental to quan- 
tum mechanics , since there is no fundamental reason why the set of 
all dynamical states should form a group. On the other hand, this set 
should certainly be a homogeneous space under the group of motions, 
since this group must transform dynamical states into one another 
and this action can be assumed to be transitive (otherwise we may as 
well restrict ourselves to an orbit). In either of its roles relative to Q\, 
W acquires a slightly different physical interpretation from the one 
it had in relation to the canonical commutation relations: Since the 
group-manifold of W is generated by the vector fields Pk , Kk and M, 
the coordinates on W are the positions Xk (generated by Pk), the ve- 
locities Vk (generated by Kk) and the variable generated by M, which 
is a degenerate form of "time" inherited form Relativity through the 
limit c~ 2 P — > M. By comparison, the coordinates on the original 
Weyl-Heisenberg group were Xk (generated by Pk), the momentum 
Pk (generated by Xk) and a dimensionless "phase angle" generated 
by the identity operator. With this new interpretation, W is the 
product of velocity phase space with "time" and truly represents the 
no n- relativistic limit of C. 

We will construct a coherent-state representation of Vq by first 
discovering its non-relativistic limit. This limit will be a represen- 
tation of a quantum mechanical version Q2 of the Galilean group Q 
and will be seen to be a close relative of the canonical coherent-state 
representation of W. It is therefore necessary first to understand just 
how the group Q is related to the Poincare group Vq and its non- 
relativistic limit Q\. Again, we will do everything at the level of Lie 
algebras. There is no problem with globalization. 

The universal enveloping algebra of p contains the mass-squared 
operator 



M = c~ 4 P 2 - c-\P x l + P 2 + P3 ) = c" 4 P - c" 2 P 2 , (22) 

which is a Casimir operator, i.e. commutes with all generators in 
p. Assuming that both P and M are represented by positive oper- 
ators and that M is invertible (and this will be the case for massive 
particles), we have formally for large c: 

c~ 2 P = (M 2 + c" 2 P 2 ) 1/2 = M + P 2 /2Mc 2 + 0(c -4 ), (23) 



where we have used the fact that M commutes with Pk. The operator 
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H = P 2 /2M (24) 

is just the Hamiltonian for the non-relativistic free particle, hence 
generates its time translations and must be included in the Lie algebra 
of the Galilean group. Let us therefore try to append it to the Lie 
algebra gi of Gi- Indeed, by eq. (10), 

[H,P k }=0 [H,M]=0 

[H,J k ]=0 [H,K k ] = iP k , [ 0) 

showing that 



g 2 = Span{K fc ,P fc ,J fc ,M,tf} (26) 

actually forms a Lie algebra with Lie brackets given by eqs. (10) and 
(25). Clearly g 2 contains gi as a subalgebra. We will refer to the cor- 
responding Lie group G2 as the quantum mechanical Galilean group. 
It is the group of translations, rotations, boosts, dynamics (i.e., time 
translations) and multiplications by constant phase factors (generated 
by M) acting on the wave functions of a non-relativistic quantum me- 
chanical particle. The relation of G2 to the classical Galilean group Q 
is as follows: The subgroup generated by M, which can be identified 
with the group of real numbers IR, is central; then Q is obtained from 
Qi by factoring out IR: 



G = Gi/n. (27) 

The action of IR on quantum mechanical wave functions, which a- 
mounts to a multiplication by a constant phase factor, is a necessary 
part of Q 2 because of [P r ,K s ] = i5 rs M which, as we have seen, is 
related to the uncertainty principle. Factoring out this action means 
ignoring that phase factor, so it is reasonable that it should give the 
classical Galilean group. On the Lie algebra level, it amounts to 
setting M = 0. Had we included Planck's constant % in eq. (10), this 
would have amounted to taking the classical limit h — > 0. The above 
relation between IR, G2 and G in an example of a central extension 
(Varadarajan [1969]). One says that G2 is a central extension of IR by 
G. 

The fact that W is a subgroup of G2 was noted by Bargmann 
[1954]; that representations of G2 are contractions of representations 
of the Poincare group was shown by Mackey [1955].* 



I thank R. F. Streater for these remarks. 
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Our object in this section is to construct coherent states which are 
naturally associated with free non-relativistic particles, just as the 
canonical coherent states are associated with the Weyl-Heisenberg 
group or the Harmonic oscillator and the spin coherent states are as- 
sociated with with SU(2). An obvious starting point would be to 
apply the Klauder-Perelomov method (chapter 1) to the quantum- 
mechanical Galilean group Q2, since it is this group which describes 
such particles. However, this method fails, due to the fact that all the 
representations of physical interest are not square-integrable. There- 
fore we will follow a more pedestrian route. Our main guides will be 
analyticity (which, it turns out, follows from an important physical 
condition) and "physical intuition." 

We return to the general case where the configuration space is IR S 
instead of IR 3 . For simplicity, we restrict ourselves to spinless parti- 
cles. It is not difficult to include spin, as will be shown later. The 
states of such particles are described by complex-valued wave func- 
tions /(x, t) of position x and time t which are are square-integrable 
with respect to x at any time t. Their evolution in time is given by 
the Schrodinger equation 



where 



i% = Hf, (1) 



H = -J-A (2) 

2m v ' 



is the Hamiltonian operator, and A is the Laplacian acting on 
L 2 (IR S ). Since H is self-adjoint, though unbounded, the solutions are 
given through the unitary one-parameter group U(t) = exp(-itH): 

/(x,f) = (tf(f)/)(x) 

= (2tt)- s / cf p exp[-ztp 2 /2m + ip • x]/(p), 

JlR B 

where it is assumed that /(x, 0), hence also its Fourier transform /(p), 
isinL 2 (IR s ). 

The key to our approach will be to note that H is a non-negative 
operator, hence the evolution group U(t) can be analytically continued 
to the lower-half complex time plane (E~ as 
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U(t - iu) = exp[-i(t - iu)H] = e~ UH ' e~ uH , u > 0. (4) 

(Note also that since H is unbounded, no analytic continuation to 
the upper-half time plane is possible.) The operator e~ uH is familiar 
from two other contexts: it constitutes the evolution semigroup for 
the heat equation (where u is time), and it is also the unnormalized 
density matrix for the Gibbs canonical ensemble describing the sta- 
tistical equilibrium of a quantum system at temperature T (where 
u = 1/kT). To get a feel for our use of this operator, let us be heuris- 
tic for a moment and consider what happens when a free classical 
free particle of mass m is evolved in complex time t = t — iu. If its 
initial position and momentum are x and p respectively, then its new 
position will be 

z(t) = x + (t — iu)p/m 

= (x + tp/m) — i{u/m)p (5) 
= x(£) — i{u/m)p. 

Since x(t) is just the position evolved in real time t, we see that z(t) 
is, in fact, a complex phase space coordinate of the same type we en- 
countered in the construction of the canonical coherent states! Armed 
with this intuition, let us now return to quantum mechanics and see if 
this idea has a quantum mechanical counterpart. The operator e~ uH , 
when applied to any function in L 2 (IR S ), gives 

/ u (x) ee (e"« H /)(x) 

= (2ir)- s [ d s p exp[-wp 2 /2m + zp-x]/(p). 

If we replace x in the integrand by an arbitrary z G (E s , the integral 
still converges absolutely since the quadratic term in the exponent 
dominates the linear term for large |p|. Clearly the resulting function 
is entire in z (differentiating the integrand with respect to zt still 
gives an absolutely convergent integral). This shows that the group 
of Galilean space-time translations, 



E/(x, t) = exp(-itH + ix • P) , 



(7) 
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extends analytically to a semigroup of complex space-time transla- 
tions 

U(z,f) = exp (-if H + iz ■ P) (8) 
denned over the complex space-time domain 

V = {(z,r) |z G (D s ,r G (T }. (9) 

This translation semigroup can be combined with the rotations and 
boosts to give an analytic semigroup extending Q2- 

Let Ti u be the vector space of all the entire functions f u (z) as 

/(p) runs through L 2 (1R S ). Then 

/„(*) = <£ |/>, (10) 

where 



e£ (p) = (27r) s exp[-wp 2 /2m - ip • z] 



= (2tt) 



exp 



my 2 /2« 



it 

2m 



(p — my/u) 2 — ip ■ x 



(11) 



are seen to be Gaussian wave packets in momentum space with ex- 
pected position and momentum given in terms of z = x — zy by 



(X k )=x k and (P k ) = (m/u)y k . 



(12) 



The e^'s are easily shown to have minimal uncertainty products. The 
momentum uncertainty can be read off directly from the exponent 
and is 



A Pk = Vm/2w, 



(13) 



hence 



A Xk = y/u/2m. (14) 

We now have our prospective coherent states and their label space 
M = (E s . To construct a coherent-state representation, we need a 
measure on M which will make the e^'s into a frame. Since the e^'s 
are Gaussian, the measure in not difficult to find: 



dfi u (z) = (m/TTu) s / 2 exp (— my 2 /u) cfxcfy. 



(15) 
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Defining 

(f\9)n u = {fu\gu) = J dn u (z)f u (z)g u (z), (16) 

we have 
Theorem 4.1. 

(a) ( • | • )<n u is an inner product on 7f u under which H u is a Hilbert 
space. 

(b) The map e~ uH is unitary from L 2 (IR S ) onto H u . 

(c) The e^'s define a resolution of unity on L 2 (IR S ) given by 

[ d» u {z) |e-><e-| = 1. (17) 

Proof. We prove that || / H 2 ^ = (f\f)n u = II /Ilia- The inner 
product can be recovered by polarization. To begin with, assume 
that / is in the Schwartz space <S(IR S ) of rapidly decreasing smooth 
test functions. Then 

/ u (x - iy) = (exp[- W p 2 /2m + y • p] /) ~(x), (18) 
hence by Plancherel's theorem, 



J cfx|/ u (x-zy) 



(2tv)- s J d s pexp(-up 2 /m + 2yp) | /(p) 



(19) 



and 



J dn u {z) | f u (z) | 2 

= {m/nu) s/2 (2-K)- s J d s pexp[-up 2 /m] | /(p) | 2 x 

J d s yexp(-my 2 /u + 2y • p) 
= (2n)~ s J d'p|/(p)| 2 = ||/|| 2 , 



(20) 
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where exchanging the order of integration was justified since the in- 
tegrals are absolutely convergent. This proves (b), hence also (a), 
for / G 5(IR S ). Since the latter space is dense in L 2 (IR S ), the proof 
extends to f E L 2 (IR S ) by continuity, (c) follows by noting that 

(f\g)L* = (f\g)H u 

= J dn u (z)f u (z)g u (z) ^ 

= J dn u {z)(f\el){el\g) 

and dropping ( / | and | g ) . I 

Using the map e~ uH , we can transfer any structure from L 2 (IR S ) 
to H u - In particular, time evolution is given by 



f u (z,t)=(e- uH (e- itH f))(z) 

= (2tt)- s J d s p exp [-zrp 2 /2m + iz ■ p] /(p) (22) 

= ( e z,r I/), 
where r = t — iu and the wave packets 

e Z)T (p) = (2tt)" s exp[-f p 2 /2m - ip ■ z] (23) 

are obtained from the e^'s by evolving in real time t. They cannot be 
of minimal uncertainty since the free-particle Schrodinger equation is 
neccessarily dissipative. Instead, they give the following expectations 
and uncertainties: 

(P k ) = (m/u)y k 

( X k ) = x k - (t/u)y k = x k - (t/m) ( P k ) 
A Pk = ^m/2u (24) 



Since 




(25) 
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it follows that 



dfi u (z)\f(z,T)\ 
w (26) 

= \\e-^ 2 / 2m f\\ 2 = Il/H 2 , 

thus we have a frame { e Z)T | z e (E s } at each complex "instant" r = 
t — ut, with the corresponding resolution of unity 

/ d/j u (z) |e ZjT )(e Z;T | = I. (27) 
J <s s 

The space L 2 (IR S ) carries a representation of the quantum me- 
chanical Galilean group Qi- Since the e Z)T 's were obtained from the 
dynamics associated with this group, they transform naturally under 
its action. A typical element of Q2 has the form g = (R,v,x.o,to,6), 
where R is a rotation, v is a boost, xo is a spatial translation, to is 
a time-translation and 9 is the "phase" parameter associated with 
the central element M = m/h = m in our representation (see section 
4.2). g acts on the complex space-time domain T> by sending the 
point (z,r) to (r',z'), where 

x' = .Rx + £v + x 

y' = Ry + itv 

, (28) 
t' = t + 1 

u' = u. 

The parameter 9 has no effect on space-time; it only acts on wave 
functions by multiplying them by a phase factor. The representation 
of Q2 is defined by 

{U g f)(z,r) = e— e f{g- 1 (z,r)). (29) 

Thus we have 

U g e z , T = e ime e g(z , T) , (30) 

and the e Z;T 's are "projectively covariant" under the action of Qi\ if we 
define e Z)T) ^ = e _im< ^e Z)T , then this expanded set is invariant under 
the action of Qi-, with <p' = <p — 9. Since e Z)T and e Z)Ti ^ represent 
the same physical state, we won't be fussy and just work with the 
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e ZjT 's. Anyway, this anomaly will disappear when we construct the 
corresponding relativistic coherent states. 

The above representation of Q 2 on L 2 (IR S ) can be transfered to 
H u using the map e~ uH . This map therefore intertwines (see Gelfand, 
Graev and Vilenkin [1966]) the representations on Q 2 on L 2 (IR S ) with 
the one on TL U . 

We conclude with some general remarks. 
1. Since the e^'s are spherical and therefore invariant under SO(n) 
(which is, after all, why they describe spinless particles), they can be 
parametrized by the homogeneous space W = Qi/SO(n) as long as we 
keep u fixed {u is a parameter associated with the Hamiltonian, which 
is a generator of Q 2 but not of Qi). The action of W as a subgroup 
of Q\ on the e^'s is preserved in passing to the homogeneous space, 
hence W acts to translate these vectors in phase space. This explains 
the similarity between the e^'s and the canonical coherent states. On 
the other hand, dynamics (in imaginary time) is responsible for the 
parameter u. If we write k = (m/u)y, then 

e u z (p) 



The measure d/j, u (z) is now 

d// u (x,k) = (u/nm) s/2 exp(-uk 2 /m) d s xd s k. (32) 

Hence the exponential factor exp[wk 2 /2m] in e™, when squared in the 
reconstruction formula, precisely cancels the Gaussian weight factor 
in d/i u (x, k), leaving the measure 

(ti/™) s/2 d s xA (33) 

in phase space. It follows from the above form of that 2Ap fe = 
^/2m/u plays the role of a scale factor in momentum space (as used 
in the wavelet transforms of chapter 1), hence its reciprocal Ax fe = 

\Juj1m acts as a scale factor in configuration space. Thus the Gali- 
lean coherent states combine the properties of rigid "windows" with 
those of wavelets, due to the fact that their analytic semigroup Q 2 in- 
cludes both phase-space translations and scaling, the latter due to the 



u 



= (27r)- s exp 
= exp[uk 2 /2m]e- ip - x h 



u 



( P-k 



(31) 



u 
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heat operator e~ . However, note that u is constant, though arbi- 
trary, in the resolution of unity and the corresponding reconstruction 
formula. Since there is an abundance of "wavelets" due to transla- 
tions in phase space, only a single scale is needed for reconstruction. 
(One could, of course, include a range of scales by integrating over 
u with a weight function, but this seems unnecessary.) In the treat- 
ment of relativistic particles, u becomes the time component of a 
four-vector y = (it, y), hence will no longer be constant. This is be- 
cause relativistic windows shrink in the direction of motion, due to 
Lorentz contractions, thus automatically adjusting to the analysis of 
high-frequency components of the spectrum. 

2. Notice that is essentially the heat operator e~ uH applied to the 
5-function at x, then analytically continued to z = x + iy. The fact 
that all the e^'s have minimal uncertainties shows that the action 
of the heat semigroup {U(—iu)} is such that while the position un- 
dergoes the normal diffusion, the momentum undergoes the opposite 
process of refinement, in just such a way that the product of the two 
variances remains constant. This is reflected in the fact that the op- 
erator e~ uH , whose inverse in L 2 (1R S ) is unbounded, becomes unitary 
when the functions in its range get analytically continued, and the 
reconstruction formula is just a way of inverting e~ uH . Hence no in- 
formation is lost if one looks in phase space rather than configuration 
space! It seems to me that this way of "inverting" semigroups must 
be an example of a general process. If such a process exists, I am 
unaware of it. In our case, at least, it appears to be possible because 
of analyticity. 

3. So far, it seems that coherent-state representations are intimately 
connected with groups and their representations. However, there is a 
reasonable chance that coherent-state representations similar to the 
above can be constructed for systems which, unlike free particles, do 
not possess a great deal of symmetry. Suppose we are given a system 
of s/3 particles in IR 3 which interact with one another and/or with 
an external source through a potential V(x). We assume that V(x) 
is time-independent, so the system is conservative. (This means that 
we do have one symmetry, namely under time translations. If, more- 
over, the potential depends only on the differences Xj — Xj between 
individual particles, we also have symmetry with respect to transla- 
tions of the center of mass of the entire system; but we do not make 
this assumption here.) This system is then described by a Schrodinger 
equation with the Hamiltonian operator H = Hq + V, where Hq is the 
free Hamiltonian and V is the operator of multiplication by V(x). We 
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need to assume that this (unbounded) operator can be extended to a 
self-adjoint operator on L 2 (IR S ). How far can the above construction 
be carried in this case? The key to our method was the positivity of 
the free Hamiltonian H = P 2 /2m. But a general Hamiltonian must 
at least satisfy the stability condition: 

(S) The spectrum of H is bounded below. 
If H fails to meet this condition, then the system it describes is un- 
stable, and a small perturbation can make it cascade down, giving 
off an infinite amount of energy. For a stable system, the evolution 
group U(t) = e~ ltH can be analytically continued to an analytic semi- 
group U(t) = e~ ll ~ H in the lower-half complex time plane as in the 
free case. Depending on the strength of the potential, the functions 
fu = U{—iu)f may be continued to some subset of (E s . Formally, this 
corresponds to defining 



for an initial function /(x) in L 2 (IR S ). As mentioned, this expression 
is formal since the operator e y P is unbounded and e~ ll ~ H f may not 
be in its domain. But it can make sense operating on the range of 
e~ irH , which coincides with the range of e~ uH , provided y is not too 
large. Let y u be the set of all y's for which e y P is defined on the 
range of e~ uH and, furthermore, the function exp [iz • P] exp [— irH\ f 
is sufficiently regular to be evaluated at the origin in 1R S , no matter 
which initial / was chosen in L 2 (JR S ). For many potentials, of course, 
y u will consist of the origin alone; in that case there are no coherent 
states. We assume that y u contains at least some open neighborhood 
of the origin. Intuitively, we may think of y u as the set of all imagi- 
nary positions which can be attained by the particle in an imaginary 
time-interval u, while moving in the potential V. In the free case, 
y u = IR S and there is no restriction on y provided only that u > 0. 
This corresponds to the fact that there is no "speed limit" for free 
non-relativistic particles, hence a particle can get to any imaginary 
position in a given positive imaginary time. For relativistic free parti- 
cles, y u is the open sphere of radius uc, where c is the speed of light. 
Returning to our system of interacting particles, define the associated 
complex space-time domain 



f(*,T)=( 




7)(0), T=t-i U 

f) (x) 



(34) 



Z H = { (x - iy, t - iu) e (E 



u > 0, y G y u }• 



(35) 
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This is the set of all complex space-time points which can be reached 
by the system in the presence of the potential V^(x), and it is the label 
space for our prospective coherent states. These are now defined as 
evaluation maps on the space of analytically continued solutions: 

/(z,r) = (36) 

the inner product being in L 2 (IR S ). Then from the above expression, 
again formally, we have the dynamical coherent states 

„H At H „ — iz-P e /o'7\ 

e Z)T = e e d (37) 

for (z, r) in Zh- 

What is still missing, of course, is the measure dfi 1 ^ . (Since the po- 
tential is t-independent, so will be the measure, if it exists.) Finding 
the measure promises to be equally difficult to finding the propaga- 
tor for the dynamics. The latter is closely related to the reproducing 
kernel, 

i^(z,r;z',f') = (< T |e^ T ,>. (38) 

Kh depends on r and f' only through the difference r — f', and is 
the analytic continuation of the propagator to the domain Zh x Zh- 
It is related to the measure through the reproducing property, 

f d^(z)K H (z',T';z,f)K H (z,T;z",f") 
J <j t (39) 

= ^(z'y ; z",f"), 



where the integration is carried out over a "phase space" o T in Zh 
with a fixed value of r = t — iu. A reasonable candidate for d^ (see 
section 4.4) is 

dtf(z) = C\\e* T \\- 2 d s xd s y 

I T II 4Q 

Rather than finding the measure explicitly, a more likely possibil- 
ity is that its existence can be proved by functional-analytic methods 
for some classes of potentials and approximation techniques may be 
used to estimate it or at least derive some of its properties. The theo- 
retical possibility that such a measure exists raises the prospect of an 
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interesting analogy between the quantum mechanics of a single sys- 
tem and a statistical ensemble of corresponding classical systems at 
equilibrium with a heat reservoir. In the case of a free particle, if we 
set k = (m/u)y as above (see remark 1) and define T by u = l/2kT 
where k is Boltzmann's constant, then it so happens that our mea- 
sure d\i u is identical to the Gibbs measure for a classical canonical 
ensemble (see Thirring [1980]) of s/3 free particles of mass m in IR 3 , 
at equilibrium with a heat reservoir at absolute temperature T. Thus, 
integrating with dfx u over phase space is very much like taking the clas- 
sical thermodynamic average at equilibrium! It remains to be seen, of 
course, whether this is a mere coincidence or if it has a generalization 
to interacting systems. 

There is also a connection between the expectation values of an 
operator A in the coherent states e^ T and its thermal average in the 
Gibbs state, 




where Z 



Trace (e /3H ) . Namely, if we have the resolution of unity 




(42) 



then 



Z 



Z 




PH/2 A e -f3H/2 e H 

Z,T 



Ae: 



H 

z,r-ip/2 



) 



) 



(43) 




where we have used the formula 




(44) 



which follows easily from eq. (42). Thus taking the thermal average 
means shifting the imaginary part u by (3/2 in the integral. 



110 



4. Complex Spacetime 



4.4. Relativistic Frames 

We are at last ready to embark on the main theme of this book: A 
new synthesis of Relativity and quantum mechanics through the ge- 
ometry of complex spacetime. The main tool for this synthesis will 
be the physically necessary condition that the energy operator of the 
total system be non-negative, also known in quantum field theory as 
the spectral condition. The (unique) relativistically covariant state- 
ment of this condition gives rise to a canonical complexification of 
spacetime which embodies in its geometry the structure of quantum 
mechanics as well as that of Special Relativity. The complex space- 
time also has the structure of a classical phase space underlying the 
quantum system under consideration. Quantum physics is developed 
through the construction of frames labeled by the complex spacetime 
manifold, which thus forms a natural bridge between the classical and 
quantum aspects of the system. It is hoped that this marriage, once 
fully developed, will survive the transition from Special to General 
Relativity. 

As mentioned at the beginning of this chapter, the Perelomov- 
type constructions of chapter 3 do not apply directly to the Poincare 
group since its time evolution (dynamics) is non-trivial. Pending a 
generalization of these methods to dynamical groups, we merely use 
the ideas of chapter 3 for inspiration rather than substance. In fact, 
it may well be that a closer examination of the construction to be 
developed here may suggest such a generalization. 

We begin with the most basic object of relativistic quantum me- 
chanics, the Klein-Gordon equation, which describes a simple rela- 
tivistic particle in the same way that the Schrodinger equation de- 
scribes a non-relativistic particle. The spectral condition will enable 
us to analytically continue the solutions of this equation to complex 
spacetime, and the evaluation maps on the space of these analytic 
solutions will be bounded linear functionals, giving rise to a repro- 
ducing kernel as in section 1.4. Physically, the evaluation maps are 
optimal wave packets, or coherent states, and it is this interpretation 
which establishes the underlying complex manifold as an extension 
of classical phase space. The next step is to build frames of such 
coherent states. (Recall from section 1.4 that a frame determines a 
reproducing kernel, but not vice versa.) 

The coherent states we are about to construct are covariant un- 
der the restricted Poincare group, hence they represent relativistic 
wave packets . As we have seen, such a covariant family is closely re- 
lated to a unitary irreducible representation of the appropriate group, 
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in this case Vq. Such representations are called elementary systems, 
and correspond roughly to the classical notion of particles, though 
with a definite quantum flavor. (For example, physical considerations 
prohibit them from being localized at a point in space, as will be 
discussed later.) We will focus on representations corresponding to 
massive particles. (A phase-space formalism for massless particles 
would be of great interest, but to my knowledge, no satisfactory for- 
mulation exists as yet.) Such representations are characterized by two 
parameters, the mass m > and the spin j = 0, 1/2, 1, 3/2, ... of the 
corresponding particle. We will specialize to spinless particles (j = 0) 
for simplicity. The extension of our construction to particles with spin 
is not difficult and will be taken up later. Thus we are interested in 
the (unique, up to equivalence) representation of Vq with m > and 
j = 0. A natural way to construct this representation is to consider 
the space of solutions of the Klein-Gordon equation 

(□ +mV)/(x) = 0, (1) 

where 

2<9 2 A 

is the Del'Ambertian, or wave operator, A is the usual spatial Lapla- 
cian and = d/dx^. The function / is to be complex-valued (for 
spin j, f is valued in G 2j+1 ). We set c = 1 except as needed for future 
reference. If we write f(x) as a Fourier transform, 

f{x) = (2tt)- s - 1 / d s+1 p e-* 3 * RP), (3) 

then the Klein-Gordon equation requires that f(p) be a distribution 
supported on the mass shell 

V m = {p = Cpo, P) e M s+1 \p 2 =pl-p 2 = m 2 }. (4) 
fi m is a two-sheeted hyperboloid, 

O m = 0+ U SI- , (5) 

where 



(2) 
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po = iv^ + p 2 = ±w(p) on 0± . (6) 

Taking 

/(p) =27r5(p 2 -m 2 )a(p) (7) 
for some function a(p) on fi m , and using 

<5(p 2 -m 2 ) = 5 ((po -w)(po + u>)) 

1 (8) 
= t^j [<5(po - + 5(Po + w)J , 



we get 



f{x) = I dpe- txp a(p) 
J ci m 

= I dp [e~ ixp a(p) + e ixp a(-p)] 
Jut. 



(9) 



where 



dp= (2n)- s (2uj)- L d s p (10) 

is the unique (up to a constant factor) Lorentz-invariant measure 
on Q m . (The factor u~ l corrects for Lorentz contraction in frames at 
momentum p.) For physical particles, we must require that the energy 
be positive, i.e. that a{p) = on f2~ . Hence the physical states are 
given as positive-energy solutions, 



f(x) = ( dpe- lxp a{p). (11) 



The function a(p) can now be related to the initial data by setting 
x° = t = 0, which shows that 

/ o (x) = /(x,0)= f dpe™»a(p) 

Jfi+ (12) 

= ((2^)- 1 a)'(x), 

so 

a(p) = a(u, p) = 2w/o(p), (13) 
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where " denotes the spatial Fourier transform. In particular, f(x) is 
determined by its values on the Cauchy surface t = 0. For general 
solutions of the Klein-Gordon equation, we would also need to specify 
df/dt on that surface, but restricting ourselves to positive-energy 
solutions means that f(x) actually satisfies the first-order pseudo- 
differential non-local equation 

i^ t =Vm*-Af(x) (14) 

(which implies the Klein-Gordon equation), hence only /(x, 0) is nec- 
essary to determine /. (We will see that when analytically continued 
to complex spacetime, positive-energy solutions have a local charac- 
terization.) The inner product on the space of positive-energy so- 
lutions is defined using the Poincare-invariant norm in momentum 
space, 

||/|| 2 =/ dp\a(p)\ 2 . (15) 

We will refer to the Hilbert space 

L\{dp) = {ae L 2 (dp) | a(p) = on O" } (16) 

as the space of positive-energy solutions in the momentum represen- 
tation. It carries a unitary irreducible representation of Vq defined as 
follows. The natural action of Vq on spacetime is 

(6,A)x = Ax + 6, (17) 

where A is a resticted Lorentz transformation (A G Cq ) and b is a 
spacetime translation. Since the Klein-Gordon equation is invariant 
under Vq, the induced action on functions over spacetime transforms 
solutions to solutions. Since the positivity of the energy is also invari- 
ant under Vq, the subspace of positive-energy solutions is also left 
invariant. Vq acts on solutions by 

(U(b,A)f)(x) = f(A- 1 (x-b)). (18) 

The invariance of the inner product on L 2 Sr {dp) then implies that the 
induced action on that space (which we denote by the same operator) 
is 



(*/(&, A) a) (p) = e lbp a(A- 1 p) 



(19) 
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The invariance of the measure dp then shows that U(b, A) is unitary, 
thus (6, A) i— > U(b,A) is a unitary representation of Vq. It can be 
shown that it is, furthermore, irreducible. 

Neither of the "function" spaces {f(x)} and L\{dp) are repro- 
ducing-kernel Hilbert spaces, since the evaluation maps / t— > f(x) and 
a m a(p) are unbounded. To obtain a space with bounded evaluation 
maps, we proceed as in the last section. Due to the positivity of the 
energy, solutions can be continued analytically to the lower-half time 
plane: 



/(x, t — iu) = dp exp (—ituj — uu + zx • p) a(p), (20) 
Jn+ 

where u > 0. As in the non-relativistic case, the factor exp(— uu) 
decays rapidly as | p | — > oo, which permits an analytic continuation 
of the solution to complex spatial coordinates z = x — iy. But since 

uj(p) = \Zm 2 + p 2 is no longer quadratic in | p | , y cannot be ar- 
bitrarily large. Rather, we must require that the four-vector (tt, y) 
satisfy the condition 

uu;-y.p>0 V( W ,p)eQ+. (21) 
In covariant notation, setting y° = -u, we must have 

yp > Vp G 0+ , (22) 

so that the complex exponential exp [— i(x — iy)p] remains bounded 
as p varies over This implies that yp > for all p e V + , where 

V+ = {p= (po, P ) G R s+1 | |p| < po/c} (23) 

is the open forward light cone in momentum space. In general, we 
need to consider the closure of V+, i.e. the cone 

V+ = {pe M s+1 | |p| < po/c}, (24) 

which contains the light cone {p 2 = | po > 0} (corresponding to 
massless particles) and the point {p = 0} (corresponding in quantum 
field theory to the vacuum state). The set of all y's with yp > is 
called the dual cone of V+, i.e., 

V^ = {ye IR S+1 1 yp > Vp G F+}. (25) 
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It is easily seen that y belongs to V\_ if and only if | y | < cy°. Note 
that as c — > oo, V + contracts to the non-negative po~axis while V\_ 
expands to the half-space {(u,y) \ u > 0, y G IR S } which we have 
encountered in the last section. V[_ coincides with V + when c = 1, 
but it is important to distinguish between them since they "live" in 
different spaces (see section 1.1). 

Thus for y G V+, setting z = x — iy, we define 

f( z ) = I dp e~ izp a(p). (26) 
Jn+ 

The integral converges absolutely for any a G L 2 + {dp) and defines a 
function on the forward tube 

T + = { x -iye(D 8+1 \yeVi}, (27) 

also known as the future tube and, in the mathematical literature, as 
the tube over V\_. Differentiation with respect to z M under the integral 
sign leaves the integral absolutely convergent, hence the function f(z) 
is holomorphic, or analytic, in 7+. As y — ■> in V+, f(z) — > f(x) in the 
sense of L+(dp). Thus f(x) is a boundary value of /(^). Clearly /(z) 
is a solution of the Klein-Gordon equation in either of the variables 
z or x. Let /C be the space of all such holomorphic solutions: 

K = {f{z)\aeL\{dp)}. (28) 

Then the map a(p) i— > /(z) is one-to-one from L^.(cZp) onto /C. Hence 
we can make /C into a Hilbert space by defining 



(/i|/ 2 )/c = <ai|a 2 ), (29) 

where the inner product on the right-hand side is understood to be 
that of L+(dp). We now show that /C is a reproducing-kernel Hilbert 
space. Its evaluation maps are given by 

E z {f) = f{z)= [ dpe-^a(p) = (e z \a), (30) 

where 



e z (p) = e^. 



(31) 
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Lemma 4.2. 

1. For each z G T + , e z belongs to L 2 + (dp), with 



= (2n)~ 1 ^y K v (2Xmc), (32) 



where v = (s — l)/2, 



A=v / ^=v / W 7 ?>0 



(33) 



and -ftT^ is a modified Bessel function (Abramowitz and Stegun 
[1964]; the speed of light has been inserted for future reference.) 
2. In particular, the evaluation maps on JC are bounded, with 



\m \ <\\e z \\\\f\\. (34) 
Proof. Set c = 1. (To recover c, replace m by mc in the end.) Then 

\\e z f= [ dpe- 2 yP^G(y). (35) 

Since G(y) is Lorentz-invariant and y G V+, we can evaluate the 
integral in a Lorentz frame in which y = (A, 0): 



G(y) = G(X,0) 
= (2tt)- 
Set p = mq. Then 



dp e 



2^Jm 2 + p 2 



exp 



-2A v / m 2 + p 2 



s^s — l 



d s q 



2^1+^ 



exp 



-2Xmy/l + q 2 



= (2yr)- s m 



exp 



-2\m\/\ + r 2 



m 



T(s/2)J v^TT^ 

s—l roo 

I dtsmh s ~ 1 t exp [—2 Am cosh t] 
Jo 



(4tv) s / 2 T(s/2) 
= (2.)- (^)V(2Am) 



(36) 



(37) 
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The reproducing kernel can be obtained by analytic continuation 
from 11 ■ 



J Z 



K(z',z) = (e z > | e z ) 

= dp exp [—i(z' — z)p] 

= (27r)_1 {^n) K ^™c), 



(38) 



where 



rj = \/-{z' - z) 2 



[{y' + y) 2 - (x' - x) 2 + 2i(y' + y)(x' - x)] 



1/2 



(39) 



is defined by analytic continuation from z' = z (when r\ = 2A) as 
follows: The square-root function is defined on the complex plane cut 
along the negative real axis. Since y and y' both belong to V+, so 
does y' + y. Now the argument of the square root is real if and only 
if (y' + y)(x' — x) = 0, and this can happen only when (x' — x) 2 < 0. 
(Otherwise, either x' —x or x — x' belongs to V\_, hence (y' + y)(x' — x) 
is positive or negative, respectively.) But then, 

~(z' - z) 2 = (y' + y) 2 - (x' - x) 2 > (y' + y) 2 > 0. (40) 

Thus for z',z G T + , the quantity —{z! — z) 2 cannot belong to the 
negative real axis, so rj is well-defined. 

The reproducing kernel is closely related to the analytically con- 
tinued (Wightman) 2-point function for the scalar quantum field of 
mass m (Streater and Wightman [1964]): 

K{z',z) = -iA+(z' -z). (41) 

We will encounter this and other 2-point functions again in the next 
chapter, in connection with quantum field theory. 
Note: We will be interested in the behavior of || e z \\ near the bound- 
ary of 7+, i.e. when A ~ 0. From the properties of K v it follows 
that 

H e * II 2 ~ ttSft x ~ 2u when A ~ °- ( 42 ) 
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In particular, the evaluation maps are no longer bounded when A = 0. 

Vq acts on T + by a complex extension of its action on real space- 
time, i.e., 

z = (b, A) z = Az + b. (43) 

This means that x' = Ax+b as before, and y' = Ay. (This is consistent 
with the phase-space interpretation of T + to be established below.) 
The induced action on /C is therefore 

(U(b,A)f)(z) = f{A- 1 (z-b)). (44) 

This implies that the wave packets e z transform covariantly under Vq, 
i.e. 

U(b,A)e z = e Az+b . (45) 

We have now established that the space /C of holomorphic posi- 
tive-energy solutions is a reproducing-kernel Hilbert space. Recall 
that picking out the positive-energy part of f(x) was a non-local op- 
eration in real spacetime, involving the pseudodifferential operator 
\Jm? — A. However, when extended to 7+, such functions may be 
characterized locally, as simultaneaous solutions of the Klein-Gordon 
equation and the Cauchy-Riemann equations, since the negative- 
energy part of f{x) does not have an analytic continuation to T + . 

We now show that 7+ may, in fact, be interpreted as an extended 
phase space for the underlying classical relativistic particles. Clearly, 
x M = 3? z M are the spacetime coordinates. Their relation to the expec- 
tation values of the relativistic (Newton-Wigner) position operators 
will be discussed below. We now wish to investigate the relation 
of the imaginary coordinates y M = —3 z M to the energy-momentum 
vector. The bridge between the "classical" coordinates y M and the 
quantum-mechanical observables will be, as usual, the (future) 
coherent states e z . Before getting involved in computations, let us 
take a closer look at these wave packets in order to get a qualita- 
tive picture. Since yp is Lorentz-invariant, it can be evaluated in a 
reference frame where p = (mc 2 , 0). Thus 

yp = y°mc 2 = a/A 2 + y 2 mc > Amc, (46) 

with equality if and only if y = 0, i.e. when y and p are parallel. This 
is a kind of reverse Schwarz inequality which holds in V+ x V+ under 
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the pairing provided by the Lorentzian scalar product. Thus for fixed 
y 6K[ and variable p G 0+ , we have 



\e z (p)\ <e" Amc , (47) 

the maximum being attained when and only when p = (mc/X)y = p y . 
Hence we expect, roughly, that 



W= , i V ~{rnc/\)y». (48) 



Therefore the vector y, while itself not an energy-momentum, acts as 
a control vector for the energy-momentum by filtering out p's which 
are "far" from p y . The larger we take the parameter A, the finer the 
filter. The expected energy-momentum can be computed exactly by 
noting that 

(e*|P„e,> = / dpp^e- 2 '" 

_ %jm (49) 

2 dy^ 

where G(y) = \\e z \\ 2 as before. Since G depends on y only through 
the invariant quantity A, we have 

' 2 dV " dX (50) 
K u+ i(2\mc) mc\ 

K u (2Xmc) rj Vfl ' 

where we have used the recurrence relation (Abramowitz and Stegun 
[1964]) 

Q 

-— (\~" K v {2\m)) = 2m\- u K v+1 {2\m). (51) 

OA 

This verifies and corrects the above qualitative estimate. In view of 
the above relation, the hyperboloid 

n+ = { y eV^\y 2 = \ 2 } (52) 
corresponds to the mass shell. Hence the submanifold 
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C x = {x-iyeT+\y 2 = A 2 } (53) 

corresponds to the extended phase space C defined in section 4.2 
which, we recall, was a homogeneous space of Vq that was interpreted 
as spacetime x velocity space. Let us define the effective mass m\ of 
the particle on by 

(m A c) 2 = (P M )(P^) = (P) 2 . (54) 

Then 



m,=m- (55) 

and 

{P,) = v ^fy,- (56) 

We claim that m\ > m, which can be seen as follows. For all p,p' G 
0+ we have the "reverse Schwarz inequality" pp' > m 2 , with equality 
if and only if p = p' . Hence 

ml = (Pf 

= G~ 2 J J dp dp pp' exp(—2yp — 2yp) (57) 
> m 2 . 

This is a kind of "renormalization effect" due to the uncertainty, or 
fluctuation, of the energy-momentum in the state e z . It appears to go 
in the "wrong direction" (i.e., (P) 2 > (P 2 )) for the same reason as 
does the inequality pp' > m 2 , namely because of the Lorentz metric. 

Thus (Pfj,) is proportional, by a y-dependent but Po^mfariarrt 
factor, to y M . We may therefore consider the y^s as homogeneous 
coodinates for the direction of motion of the classical particle in (real) 
spacetime. Alternatively, the expectation of the velocity operator 
P/Po can be shown to be y/yo- Thus of the s + 1 coordinates y M , 
only s have a "classical" interpretation. It is important to understand 
that the parameter A has no relation to the mass; it can be chosen to 
be an arbitrary positive number and has the physical dimensions of 
length. It is the relativistic counterpart of the parameter u encoun- 
tered in connection with the non-relativistic coherent states, and its 
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significance will be studied later. At this point we simply note that A 
measures the invariant distance of z from the boundary of T + . The 
larger A, the more smeared out are the spatial features and the more 
refined are the features in momentum space. (Recall that the imag- 
inary part u of the time played a similar role in the non-relativistic 
theory.) 

Because the vector y is so fundamental to our approach, it deserves 
a name of its own. We will call it the temper vector. The name is 
motivated in part by the smoothing effect which y has on spacetime 
quantities, and also by the fact that y plays a role similar to that 
played by the inverse teperature (3 = 1/kT in statistical mechanics; 
the latter controls the energy. 

From the asymptotic properties of the K u 's it follows that 




This can be understood as follows: When Amc — > oo, e.g. c — > oo 
for fixed Am, we recover the non-relativistic results. For example, 
the expectations of the spatial momenta approach those in the non- 
relativistic coherent states: 

(P k ) ~ (mc/X)y k = (m/u)y k . (59) 

When Amc — > 0, say A — > for fixed mc, then z approaches the bound- 
ary of 7+. In that case, fluctuations take over and the expectations 
become independent of the mass m. 

The relation of the spacetime parameters x M = 3?z M to the 
Newton-Wigner position operators is as follows. Since a fixed / G T + 
describes the entire history of the particle, the associated state does 
not change with time (i.e., the dynamics is already built in). This 
means that we are in the so-called Heisenberg picture, and time- 
behavior must be described by evolving the observables A: 

A(t) = e ltp ° A(0) e~ ltp ° . (60) 

The Newton-Wigner position operators are uniquely determined by a 
set of seemingly reasonable assumptions concerning the localizability 
of the particle (Newton and Wigner [1949], Wightman [1962]), and 
are given in the momentum representation at time xq = by 
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dpk ^ 

=i {i- k -B)> k s - 

Now choose z = x — iy £ T + with xq = 0. Then 



(e z \X k (0) e ; 



)= I dp u 1/2 e~ lzp i^- L- 1/2 e l ~ zp ] 
Jn+ dp k V J 

= ft [ dpu^e-^i^-iuj-^e^) 
Jn+ dp k V / 



(62) 



hence 



(X k (0)) = x k . (63) 

It must be noted, however, that the concept of position for relativistic 
particles is highly unsatisfactory. Not only are the position operators 
non-covariant (this would seem to require a time operator on equal 
footing with them, which would exclude the possibility of dynamics); 
but the very concept of localizability for such particles is fraught with 
difficulties. For example, eigenvectors of the Newton-Wigner position 
operators, known as "localized states," spread out from a single point 
at time xq to fill the entire universe an arbitrarily small time later, 
violating relativistic causality. (The same phenomenon in the non- 
relativistic theory presents no conceptual problem, since propagation 
velocities are unrestricted there.) Even much weaker notions of local- 
ization give rise to problems with causality (Hegerfeldt [1985]). In my 
opinion, it is best to admit that position is simply a non-relativistic 
concept, and in the relativistic theory events x should be regarded as 
mere parameters of the spacetime manifold. As such, our formalism 
extends them to z = x — iy, with the new parameters y playing the 
role of a control vector for the energy-momentum observables. Thus, 
in place of a set of pairs of canonically conjugate observables X k , P k , 
we have a set of observables _P M and a dual set of complex parameters 
z^. The symmetry between position- and momentum operators in 
the non-relativistic theory was based on the Weyl-Heisenberg group, 
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and we have seen that this symmetry is "accidental," being broken in 
the transition to relativity. 

Note: This further reinforces the idea discussed in section 4.2, that 
"group-theoretic quantization," i.e. the quantization of classical sys- 
tems obtained by requiring states and observables to transform under 
unitary representations of the associated dynamical groups or cen- 
tral extensions thereof, is superior to "canonical quantization" (send- 
ing the classical phase-space observables to operators, with Poisson 
brackets going to commutators). # 

Let us now consider the uncertainties in the energy- and momen- 
tum operators. More generally, we compute the correlation matrix 

C^ = (P^)-(P M )(P,), (64) 
of which the uncertainties Ap^ are the diagonal elements. From 

1 F) 2 C 

(e x \P*P v e z ) = - „ Z 7 (65) 



it follows that 



d 2 lnG 



dy»dy» dy^ dy v ^ 



dy^dy v 



(Incidentally, this shows that the function InG is of some interest in 
itself: its first partials are the expected momenta, while its second 
partials are the correlations.) A computation similar to that for ( P M ) 
gives 

_ m 2 / K v+2 {2\m) _ m|\ _ mx 

" V*V» A 2 ^ Kv (2\m) m 2 J ^ 2A ' 1 J 

Although this expression does not appear to be enlightening in any 
obvious way, the uncertainties A can be estimated from it in various 
limits such as Am — > oo and Am — > 0. 

The reproducing kernel by itself is of limited use. Although it 
makes it possible to establish the interpretation of 7+ as an extended 
classical phase space, it does not provide us with a direct physical 
interpretation of the function values f(z). The inner product in /C is 
borrowed from L+(dp), hence a probability interpretation exists, so 
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far, only in momentum space. In the standard formulation of Klein- 
Gordon theory, it is possible to define the inner product in configu- 
ration space, but the corresponding density turns out not to be posi- 
tive, thus precluding a probabilistic interpretation. This is one of the 
well-known difficulties with the first-quantized Klein-Gordon theory, 
and is one of the reasons cited for the necessity to go to quantum 
field theory (second quantization). We will see that the phase-space 
approach does admit a covariant probabilistic picture of relativistic 
quantum mechanics, thus making the theory more complete even be- 
fore second quantization. These topics will be discussed further in 
the next section and the next chapter. At this point we wish only 
to define an "autonomous" inner product on /C as an integral over a 
"phase space" lying in 7+. This will provide us with a normal frame 
of evaluation maps (chapter 1). 

Recall that in the non-relativistic theory of the last section, the 
norm in the space 7i u of holomorphic solutions was obtained by inte- 
grating | /(z, t) I 2 with respect to the Gaussian measure dfi u (z) over 
the phase space r = t — iu = constant. But now, for given y° = u, 
only those y's with | y | < cu belong to V[. That is, the particle can 
only travel a finite imaginary distance in a finite imaginary time. In 
view of the relation (P^) oc y M , the obvious candidate for a phase 
space is the set defined for given t G IR and A > by 

a = <r t ,x = {x-iyeT+ \ x° = t, y 2 = A 2 }. (68) 

Such sets are not covariant, but a covariant extension will be found in 
the next section. As for the measure, a Gaussian weight function (such 
as exp(— my 2 /ti), which occured in dji u {y)) is no longer satisfactory 
since it cannot be covariant. It turns out that we do not need a 
weight function at all! This can be seen as follows: In the non- 
relativistic case, the shift to complex time was performed once and 
for all by the operator e~ uH . For fixed u > 0, the weight function 
served to correct for the non-unitary translation from the real point 
x in space to the complex point z = x — iy. However, if we restrict 
ourselves to the subset cr, then a translation to imaginary space is 
necessarily accompanied by a translation in imaginary time. The 

analog of the above translation is (t — iX, x) i— > (t — i a/A 2 + y 2 , x — iy) . 
The increase in y , it turns out, precisely compensates for the shift 
to complex space! This follows from the fact that the operator e~ yP , 
which affects the total shift to complex spacetime, is relativistically 
invariant, hence the point y = no longer plays a special role. We 
will show later that in the non-relativistic limit, we recover the weight 
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function naturally. Hence the Gaussian weight function associated 
with the Galilean coherent states (which, as we have seen, is closely 
related to that associated with the canonical coherent states) has its 
origin in the geometry of the relativistic phase space, i.e. in the 
curvature of the hyperboloid {y 2 = A 2 }. 
For a = a t ,\ as above and / G /C, define 

2 = [ da\f(z)\ 2 , (69) 

where we parametrize a by (x, y) G M 2s and the measure da is given 
by 



da(z) = A^efxefy (70) 



with 



mc J 
2-7rA\ s m\ 
mc J m 

Then we have the fo lowing result. 
Theorem 4.3. For all f G K,, 



( 7rA A 

A\ = 7T -1 ( — ) K u+1 (2\mc) 



G{\). 



(71) 



(72) 



Proof. Assume, to begin with, that a(p) = a(u>, p) belongs to the 
Schwartz space 5(IR S ) of rapidly decreasing test functions. Then 

f{z) = (2tt)- s J d s p {2u)~ l exp(-ixp - yp) a(p) 

= [(2w) _1 exp(— itu — yp) a] ~(x). 
Hence, by Plancherel's theorem, 



/ rf s x \f( x -iy)\ 2 = (2n)~ s [ d s p {Au^e'^ \ a(p) \ 2 . 

Jx°=t JTR S 

(74) 

Exchanging the order of integration in the integral representing 



2 , we obtain 
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2 = A- X \2n)~ s J d a p (4a; 2 )- 1 1 a(p) | 2 J d 8 ye 



2yp 



(75) 



We now evaluate 



J(p) = J d s ye~ 2 ™ 



(76) 



as follows: Consider all s + 1 components of p as independent and 
define m(p) = a/p 2 - From the integral computed earlier, i.e. 



(77) 



G(y) = (2tt)- s J d s p{2u)- 1 e- 2 ^ 
= (27r)- 1 (£^)^,(2Am), 
we obtain by exchanging p and y (as well as m and A): 

J d s y (2y )- 1 e- 2 ^ = " tf„(2Aro). (78) 

Taking the partial derivative with respect to p° on both sides gives 

d 



J(p) = 



dp 



d 



dp [ 

" (2yrA } ay d£ 



(79) 



[r"*,(0] 



where £(p) = 2Xm(p). Using again the recurrence relation for the 
-ftVs, we get 



J(p) = 2p A x . 



Thus 



= (2tt)- s J d s p(2u)~ 1 \a(p) | 2 = 



(80) 



(81) 



4.4. Relativistic Frames 



127 



for a(p) G S(IR S ). By continuity, this extends to all a G L 2 + {dp) since 
S{M S ) is dense in L\(dp). I 
If we define 



J <j 



(82) 



then by polarization we have the following immediate consequence of 
the theorem. 

Corollary 4.4. For all fi,f 2 G /C, 

(/l|/2>a=(/l|/2>)C (83) 

In particular, ( • | • ) a defines a Vo-invariant inner product on JC, and 
we have the resolution of unity 

[ da x (z) \ e z )(e z \ = J, (84) 

J a 

making the vectors e z with z G a a normal frame. I 

Note: The above results extend to the case A = by continuity. The 

normalization constant in the measure da is then given by 

A = I^A X = ^(» + V (85) 
U xio A 2(mc) s+1 K ' 

The same formula applies for fixed A > and m ~ 0, and shows that 
A\ becomes unbounded as m — > 0. # 

The vectors e z belong to L^_(dp), but correspond to vectors e z in 
/C defined by 

e z (z') = (e z > \e z ) = K(z' - z). (86) 

The norm ||/||^ = ( / | f) a on K provides us with an interpretation 
of | f(z) | 2 as a probability density with respect to the measure c&ta 
on the phase space a. Within this interpretation, the wave packets e z 
have the following optimality property: For fixed z G 7+ let 

e z {z') = (87) 

ll e 2|l 

Proposition 4.5. Up to a constant phase factor, the function e z is 
the unique solution to the following variational problem: Find f G /C 
such that ||/|| = 1 and | f(z) \ is a maximum. 
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Proof. This follows at once from the Schwarz inequality and theorem 
4.3, since by eq. (26), 

\f(z) | = | (e z \a) | = | (e z \f) \ 
<P.|| H/ll = \\e z \\ H/ll, 

with equality if and only if / is a constant multiple of e z . I 
According to our probability interpretation of | f(z) | 2 , this means 
that the normalized wave packet e z maximizes the probability of find- 
ing the particle at z. 

Note: Unlike the non-relativistic coherent states of the last section, 
the e z 's do not have minimum uncertainty products. In fact, since the 
uncertainty product is not a Lorentz-invariant notion, it is a priori 
impossible to have relativistic coherent states with minimum uncer- 
tainty products. The above optimality, which is invariant, may be 
regarded as a reasonable substitute. Actually, there are better ways 
to measure uncertainty than the standard one used in quantum me- 
chanics, which is just the variance. From a statistical point of view, 
the variance is just the second moment of the probability distribu- 
tion. Perhaps the best definition of uncertainty, which includes all 
moments, is in terms of entropy (Bialynicki-Birula and Mycielski 
[1975], Zakai [I960]). Being necessarily non-linear, however, makes 
this definition less tractable. 



4.5. Geometry and Probability 

The formalism of the last section was based on the phase space a t ,\ = 
a and the measure da, neither of which is invariant under the action of 
Vq on T + . Yet, the resulting inner product ( • | • ) a is clearly invariant. 
It is therefore reasonable to expect that a and da merely represent 
one choice out of many. Our purpose here is to construct a large nat- 
ural class of such phase spaces and associated measures to which our 
previous results can be extended. This class will include a and will 
be invariant under Vq. In this way our formalism is freed from its de- 
pendence on a and becomes manifestly covariant. As a byproduct, we 
find that positive-energy solutions of the Klein-Gordon equation give 
rise to a conserved probability current, so the probabilistic interpre- 
tation becomes entirely compatible with the spacetime geometry. As 
is well-known, no such compatibility is possible in the usual approach 
to Klein-Gordon theory. 
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We begin by regarding T + as an extended phase space (symplectic 
manifold) on which Vq acts by canonical transformations. Candidates 
for phase space are 2s-dimensional symplectic submanifolds uCT + , 
and Vo maps different cr's into one another by canonical transforma- 
tions. A submanifold of the "product" form a = S — where S 
(interpreted as a generalized configuration space) is an s-submanifold 
of (real) spacetime IR S+1 , turns out to be symplectic if and only if S is 
given by x° = t (x) with |Vt| < 1, that is, if and only if S is nowhere 
timelike. (This is slightly larger than the class of all spacelike config- 
uration spaces admissible in the standard theory.) The original a tt \ 
corresponds to t (x) =constant. The results of the last section are 
extended to all such phase spaces of product form. 

The action of Vq on T + is not transitive but leaves each of the 
(2s + l)-dimensional submanifolds 

T+ = {x-iyeT + \y 2 = X 2 } (1) 

invariant. Each is a homogeneous space of Vq, with isotropy 
subgroup SO(s), hence 

T+^V /SO(s). (2) 

Thus corresponds to the homogeneous space C of section 4.2 
(where we had specialized to s = 3). In view of the considerations 
in sections 4.2 and 4.4, each can be interpreted as the product of 
spacetime with "momentum space" . Phase spaces a will be obtained 
by taking slices to eliminate the time variable. 

On the other hand, we also need a covariantly assigned measure 
for each a. The most natural way this can be accomplished is to begin 
with a single Vo-invariant symplectic form on T + and require that its 
restriction to each o be symplectic. This will make each a a symplectic 
manifold (which, in any case, it must be to be interpreted as a classical 
phase space) and thus provide it with a canonical (Liouville) measure. 
Thus we look for the most general 2-form a on T + such that 

(a) a is closed, i.e., da = 0; 

(b) a is non-degenerate, i.e., the (2s + 2)-form a s+1 = a A a A • • ■ A a 
vanishes nowhere; 

(c) for every g = (a, A) e Vq, g*a = a, where g*a denotes the pull- 
back of a under g (see Abraham and Marsden [1978]). Since every 
^"invariant function on T + depends on z only through y 2 , the 
most general invariant 2-form is given by 
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a = (j)(y 2 ) d Vll A dx» + ^{y 2 ) y^dy" A dx v . (3) 

Now the restriction (pullback) of the second term to T A + vanishes, 
since it contains the factor y^dy^ = d(y 2 )/2. Furthermore, the coeffi- 
cient (p(y 2 ) of the first term is constant on T^~. Hence we may confine 
our attention to the form 

a = dy^ A cfe M (4) 

without any essential loss of generality. This form is symplectic as 
well as invariant, hence it fulfills all of the above conditions. T + , 
together with ct, is a symplectic manifold, and invariance means that 
each g EVo maps T + into itself by a canonical transformation. 

A general 2s-dimensional submanifold a of T + will be a potential 
phase space only if the restriction, or pullback, of a to a is a symplectic 
form. We denote this restriction by a a . Let a be given by 

a = {z E T+ | s(z) = h(z) = 0}, (5) 

where s(z) and h(z) are two real-valued, C°° (or at least C 1 ) functions 
on T + such that ds A dh ^ on a. For example, a t; \ can be obtained 
from s(z) = x° — t and h(z) = y 2 — A 2 . The pullback a a depends only 
on the submanifold a, not on the particular choice of s and h. 
Proposition 4.6. The form a a is symplectic if and only if the 
Poisson bracket 

ds dh dh ds 
{$M = d^dy~~dx^dy~^ () 

everywhere on a. 

Proof. a a is closed since a is closed and d(a a ) = (da)^. Hence a a 
is symplectic if and only if it is non-degenerate, i.e. if and only if its 
s-th exterior power a s vanishes nowhere on a. Now (a ff ) s equals the 
pullback of a s to cr, and a straightforward computation gives 



a s = si dy ^ A dx^ (7) 



where 



dy^ = (-l) s dy A dy 1 A • • • A cfy M _i A dy^ +1 A • • • A dy s 
dx„ = (-l) s dx s A dx 3 ' 1 A • • • A dx» +1 A dx^' 1 A • • • A dx°. 



(8) 
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(dy^ and dx^ are essentially the Hodge duals (Warner [1971]) of dy^ 
and dx 11 , respectively, with respect to the Minkowski metric.) 
Let {tti, . . . , u 2s , vi, v 2 } be a basis for the tangent space of T + at z G cr, 
with {tti, . . . , -U2 S } a basis for the tangent space o z of cr. Since (is and 
dh vanish on the vectors Uj, 

(a s Ads A dh)(ui, . . . , u 2s , vi, v 2 ) 

= a s (ui, . . .,u 2s ) (ds A dh)(vi,v 2 ) (9) 
= a£(iti, . . .,v,2s) (ds A dh)(v 1 ,v 2 ). 

By assumption, (ds Adh)(vi,v 2 ) 7^ 0. Therefore a a is non-degenerate 
at z if and only if a s A ds A dh 7^ at z. But by eq. (7), 

a s A ds A dh = si {s, h} dy A dx, (10) 

where 



dy = dyo A • • • A dy s 
dx = dx s A ■ ■ ■ A dx°. 



(11) 



Hence a 8 a 7^ at z if and only if {s, /i} 7^ at z. I 

Let us denote the family of all such symplectic submanifolds a by 

So- 

Proposition 4.7. Let a G E and g eVq. Then go G E and tiie 
restriction g: a — > per is a canonical transformation from (a, a CT ) to 
(per, a ff(T ). 

Proof. Let p* denote the pullback map defined by p, taking forms 
on cycr to forms on a. Then the invar iance of a implies that 

g*a gcr = a a . (12) 

Hence a ga is non-degenerate. It is automatically closed since a is 
closed. Thus go G E . To say that g: a — > per is a canonical transfor- 
mation means precisely that and ct scr are related as above. I 

We will be interested mainly in the special case where h(z) = 
y 2 — A 2 for some A > and s(z) depends only on x. Then the s- 
dimensional manifold 

S = {x G M s+1 I s(x) = 0} (13) 

is a potential generalized configuration space, and a has the "product" 
form 
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a = S = {x -iy G T + | x G S,y G 0+}. (14) 

The following result is physically significant in that it relates the 
pseudo-Euclidean geometry of spacetime and the symplectic geom- 
etry of classical phase space. It says that a is a phase space if and 
only if S is a (generalized) configuration space. 

Theorem 4.8. Let a = S — be as above. Then {o,a a ) is 
symplectic if and only if 

ds ds , \ 

>0, (15) 



dx^ dx/j, 

that is, if and only if S is nowhere timelike. 
Proof. On cr, we have 

d s 

{*>*} = 2^ri/"*o, (16) 

and we may assume {s, h} > without loss. For fixed x G S, the 
above inequality must hold for all y G O^", hence for all y G Vl_. This 
implies that the vector ds/dx^ is in the dual V+ of Vl_. I 

We denote the family of all u's as above (i.e., with S nowhere 
timelike) by E. It is a subfamily of So and is clearly invariant under 
Vq. Note that E admits lightlike as well as spacelike configuration 
spaces, whereas the standard theory only allows spacelike ones. 

We will now generalize the results of the last section to all o G S. 
The 2s-form a s a defines a positive measure on a, once we choose an 
orientation (Warner [1971]) for a. (This can be done, for example, by 
choosing an ordered set of vector fields on a which span the tangent 
space at each point; the order of such a basis is a generalization of 
the idea of a "right-handed" coordinate system in three dimensions.) 
The appropriate measure generalizing da of the last section is now 
defined as 

da = {s\A x y 1 al. (17) 

da is the restriction to a of a 2s-form defined on all of 7+, which we 
also denote by da. (This is a mild abuse of notation; in particular, 
the "cT here must not be confused with exterior differentiation!) By 
eq. (7), we have 

da = Al l dy^ Adx^. (18) 
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We now derive a concrete expression for da. Since s obeys eq. (15) 
and ds ^ on a, we can solve ds = (satisfied by the restriction 

of ds to a) for dx° and substitute this into dxf-- This (and a similar 
procedure for y) gives 



as ^ 


i 1 ds 


dx°j 




dh \ 


_1 dh 


dyoj 


' 5y M 



(19) 



Ty "={w)~ w/ y0={y " /y0)ry ° 



on cr. Hence 



We identify cr with IR 2s by solving s(x) =0 for x° = t (x) and 
mapping 

(t (x) - z^ + y 2 , x - iy) h- (x, y). (21) 

We further identify dy A dxo with the Lebesgue measure <i s y <i s x on 
M 2s (this amounts to choosing a non-standard orientation of IR 2s ). 
Thus we obtain an expression for da as a measure on IR s . Now 
s(x) = on a implies that 



0= A*(f(x),x) 
+ 



(22) 



cte cte fc <9x fc ' 
which can be substituted into the above expression to give 



da = AT 1 (\--9LV-\ d s yd s * 

x V dx k y° J J (23) 

= A- 1 (l-Vt-(y/y ))d s yd s x. 
But eq. (15) implies that | Vt (x) | < 1, hence for y e V+, 

|V*.(y/y )|<l (24) 
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and da is a positive measure as claimed. The above also shows that if 
| Vt (x) | = 1 for some x, then da becomes "asymptotically" degener- 
ate as | y | — > oo in the direction of Vt (x). That is, if a is lightlike at 
(t (x),x), then da becomes small as the velocity y/yo approaches the 
speed of light in the direction of Vt (x). This means that functions 
in L 2 (da) (and, in particular, as we shall see, in JC) are allowed high 
velocities in the direction Vt (x) at (t(x),x) G S. This argument 
is an example of the kind of microfocal analysis which is possible in 
the phase-space formalism. (In the usual spacetime framework, one 
cannot say anything about the velocity distribution of a function at a 
given point in spacetime, since this would require taking the Fourier 
transform and hence losing the spatial information.) 

For a G E, denote by L 2 (da) the Hilbert space of all complex- 
valued, measurable functions on a with 

\\f\\l = / da\f\ 2 <oo. (25) 

If / is a C°° function on T + , we restrict it to a and define ||/|| CT as 
above. Our goal is to show that ||/|| CT = \\f\\ic f° r every / G /C. To 
do this, we first prove that each / G K defines a conserved current in 
spacetime, which, by Stokes' theorem, makes it possible to deform the 
phase space a t> \ of the last section to an arbitrary a = S — G E 
without changing the norm. For / G /C, define 

J»(x)=A- 1 [ Ty»\f{x-iy)\\ (26) 

where O^" has the orientation defined by dy °, so that J°(x) is positive. 
Then 

ll/U* = / Tx,T{x), (27) 

where S has the orientation defined by dxo. (The restriction of dxo 
to S does not vanish since | Vt (x) | < 1.) 

Theorem 4.9. Let f(p) be C 00 with compact support. ThenJ fl (x) 
is C°° and satisfies the continuity equation 



Proof. By eq. (19), 



(28) 
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J»(x) = A? f dyy»\f(x-ty)\ 2 , 
where dy = dy °/y°. The function 



(29) 



F£(y,p,q) = y^exp[ix(p- q) -y(p + q)] f(p)f(q) 
is in L x {dy x dp x dq), hence by Fubini's theorem, 

J»(x) = A- 1 [dy f + dpdqFg{y,p,q) 



(30) 



x I dpdq exp[ix(p-q)] f(p) f(q)H tI (p + q), 

o+ y 0+ 



(31) 



where, setting k = p + q, n = yk 2 and using the recurrence relation 
for the K^s given by eq. (51) in section 4.4, we compute 



ly y^e~ yk 



d 



dk, 



-yk 



d 



dk, 



(32) 



2n\ 
V 



= k»H(n). 

H(rj) is a bounded, continuous function of r\ for r\ > 2m, and 
■/"(*) = A" 1 / dpd? exp [ix(p - ? )] /(«) (p" + O 

(33) 

Since /(p) has compact support, differentiation under the integral 
sign to any order in x gives an absolutely convergent integral, proving 
that J M is C°°. Differentiation with respect to x^ brings down the 
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factor i(p fI — q^) from the exponent, hence the continuity equation 
follows from p 2 = q 2 = m 2 . I 

Remark. The continuity equation also follows from a more intuitive, 
geometric argument. Let 

B+ = {y G VI | y° > ^X 2 +y 2 }, (34) 
oriented such that 

0+ = -dB+ (35) 

(the outward normal on dB^ points "down," whereas is oriented 
"up"). Then by Stokes' theorem, 



J^x) = A~ 1 [ dy»\f(x-iy) 



2 



= -A~ x x J B+ d(dy»\f{x-iy)\ 2 ). 



(36) 



Here, d represents exterior differentiation with respect to y, and since 
the s-form dy M contains all the dy^s except for dy^, we have 

J"(x) = -A- 1 dy A | f(x - iy) | 2 , (37) 

where dy is Lebesgue measure on B+. To justify the use of Stokes' 
theorem, it must be shown that the contribution from | y | — > oo 
to the first integral vanishes. This depends on the behavior of f(z), 
which is why we have given the previous analytic proof using the 
Fourier transform. Then the continuity equation is obtained by dif- 
ferentiating under the integral sign (which must also be justified) and 
using 

d 2 | / | 2 



which follows from the Klein-Gordon equation combined with ana- 
lyticity, since 
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dx^dy^ 



d 
d 2 



d 

dz» 



— %- 







d 



dz,, dz, 



d 2 



(39) 



dzvdz^ dz^dz^ 



= i{U- z -U z ). 
Incidentally, this shows that 

d 



/(*) 



= i 



f(z)d„f(z)-dj(z).f(z) 



(40) 



is a "microlocal," spacetime-conserved probability current for each 
fixed y G V+, so the scalar function | f(z) \ 2 is a potential for the 
probability current. We shall see that this is a general trend in the 
holomorphic formalism: many vector and tensor quantities can be 
derived from scalar potentials. 

Eqs. (37) and (40) also show that our probability current is a 
regularized version of the usual current associated with solutions in 
real spacetime. The latter (Itzykson and Zuber [1980]) is given by 



» 



which leads to a conceptual problem since the time component, which 
should serve as a probability density, can become negative even for 
positive-energy solutions (Gerlach, Gomes and Petzold [1967], Barut 
and Malin [1968]). By contrast, eq. (36) shows that J (x) is stricly 
non-negative. The tendency of quantities in complex spacetime to 
give regularizations of their counterparts in real spacetime is further 
discussed in chapter 5. 

We can now prove the main result of this section. 
Theorem 4.10. Let a = S - zO+ G E and f e K . Then Wf]^ = 
ll/lk- 

Proof. We will prove the theorem for /(p) in the space U(1R S ) of 
C°° functions with compact support, which implies it for arbitrary 
/ G L\ (dp) by continuity. Let S be given by x° = t (x) , and for 
R > let 



138 



4. Complex Spacetime 



D R = 


{x 


GlR s+1 | 


X 


< R,x° 


G[0,t(x)]}, 


Er = 


{x 


GlR s+1 | 


X 


= R,x° 


G[0,t(x)]}, 


Sor = 


{x 


GlR s+1 | 


X 


< R,x° 


= 0}, 


Sr = 


{x 


GlR s+1 [ 


X 


< R,x° 





(41) 



where [0, t (x)] means [t (x), 0] if t (x) < 0. We orient Sqr and Sr by 
dxo, Er by the "outward normal" 

s 

f = R~ 1 ^x k dx k , (42) 
k=i 

and D R so that dD R = Sr-S or + E r . Now let /(p) G V(M S ). Then 
J^(x) is C°°, hence by Stokes' theorem, 



/ J^{x)dx^= I d(j»dx^ 

J Sr — Sqr+Er J Dji 

= ("1) S / 

Jd r 



dx» 



(43) 



We will show that 
A 



(R) = I J^dx^ -> as i? -> oo (44) 

J En 



En 

(i.e., there is no leakage to |x| — > oo), which implies that 



2 = lim / J^dx, 

Sr 



R^oo ' 



R^ca 



lim / J M cb M 



(45) 



by theorem 1 of section 4.4. To prove that A(R) — > 0, note that on 
Er, dxQ = and 

dx k = x k = x k = ---=x k , (46) 

X\ x 2 x s 
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each form being defined except on a set of measure zero; hence 



v = R d ^. (47) 



Byeq. (29), 



hence 



\J\x)\< J°(x), (48) 



l A (^)l=IE/ Jk dx* 

k=x Je r 



HE/ I 

fc=i ,/B « Xl (49) 

<s [ J°R^ 

J E R Xx 



En 

Now by eqs. (31) and (32), 



= s I J°r = a(R). 



J°{x)= I d s pd s qe ll(p - 9 V(p,q), 



where 



(50) 



0(p,q) = /(p)/(«)V'(P,q), ( 51 ) 
and V e C°°(IR 2s ). Hence <p G £>(IR 2s ). Let 

D = x-V p , (52) 
where x = x/i?, and observe that for x e -Er, 

D« = -//?( 1 - -Cx- v ) r" 7 ' 



i? 7 (53) 

where v = p/po- Since <fi has compact support, there exists a constant 
a < 1 such that |v| < a and |v'| < a for all (p, p') in the support 
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of 4>. Furthermore, since |Vt(x)| < 1, given any e > we have 
< -R(l + e) for £ G E R for R sufficiently large; hence 



> 1 — a(l + e) for x G E R and p G supp <fi. 
Choose 0<e<Q! -1 — 1, substitute 



De ixp , xeE R 



into the expression for J°(x) and integrate by parts: 

J°(x) = (iR)- 1 [ d s pd s qe ix ^ D ( 

= (iR)' 1 [ rf s prf s qe-^)^(p,q). 
Jtr 2s 

This procedure can be continued, giving (for x G E R ) 



(54) 



(55) 



(56) 



J°(x) = (iR)~ n [ d'pd'qe**^ 4 n) (P,q). 
Jm 2s 



n = 1,2, 



where 



(n) 



(Mj^Doff^q) 



x-V p (l-^x.v 



0(p,q)- 



(57) 



(58) 



Now (D o £ x ) n is a partial differential operator in p whose coeffi- 
cients are polynomials in D k (£~ 1 ) with k = 0, 1, • • • , n. We will show 
that for x G E R with R sufficiently large, there are constants bk such 
that 

\D k {r l )\<b k , fc = 0,l,--., (59) 

which implies that 

ll0i n) l|Li(n 2 -) < Cn, x G E R , n = 1, 2, • • • 



(60) 
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for some constants c n , so that by eqs (49) and (57), 

a(R) = s I J°r 

J En 

=<sR~ n [ ||^|| Ll(ni2s) f(x) 
— > as R — > oo 

if we choose n > s. To prove eq. (59), note that it holds for k 
eq. (54) and let u = x • v. Then 

1 - u 2 

Du = {±- V p ) (x • p/po) = , 

and if for some 



Po 



where Pk is a constant-coefficient polynomial, then 

D k+i u = P'MDu _ kP k (u) 
Po Po +1 
Pk+i(u) 

Po 

hence eq. (63) holds for k = 1, 2, • • • by induction. Thus 

it R Po 



which implies 



|^|< ^ max|P fc (u)|. 
1 1 m fc M<r 1 
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But D k (^~ 1 ) is a polynomial in £ _1 and D 2 ^, • • • , D k £; hence eq. 
(59) follows from eqs. (54) and (66). I 

The following is an immediate consequence of the above theorem. 
Corollary 4.11. 

(a) For every o G S, the form 

{h\h).= [ dah{z)h{z) (67) 

defines a Vo-invariant inner product on JC, under which JC is a 
Hilbert space. 

(b) The transformations (U g f)(z) = f(g~ l z), g G Vq, form a unitary- 
irreducible representation of Vq under the above inner product, 
and the map f i— > / from L 2 + (df>) to fC intertwines this represen- 
tation with the usual one on L 2 ^(dp). 

(c) For each a G S, we have the resolution of unity 

da\e z )(e z \ =1 (68) 

on L^dp) (or, equivalently, on /C if e z is replaced by e z . ) I 
Note: As in section 4.4, all the above results extend by continuity to 
the case A = 0. # 



4.6. The Non— Relativistic Limit 

We now show that in the non-relativistic limit c — > oo, the foregoing 
coherent-state representation of Vq reduces to the representation of 
Qi derived in section 4.3, in a certain sense to be made precise. As a 
by-product, we discover that the Gaussian weight function associated 
with the latter representation (hence also the closely related weight 
function associated with the canonical coherent states) has its origin 
in the geometry of the relativistic (dual) "momentum space" 
That is, for large |y| the solutions in K, are dampened by the factor 

exp[— \/\ 2 + y 2 cu] in momentum space, which in the non-relativistic 
limit amounts to having a Gaussian weight function in phase space. 

In considering the non-relativistic limit, we make all dependence 
on c explicit but set h = 1. Also, it is convenient to choose a coordi- 
nate system in which the spacetime metric is g = diag(l, — 1, . . . , — 1), 
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so that y° = yo = a/A 2 + y 2 and p° 
and let A = uc. Then 



Po 



\Jm 2 c 2 + p 2 . Fix it > 



you 



\J u 2 c 2 + y 2 \J m 2 c 2 + p 2 



time + 



2 , up , my' _ 2 



+ 



+ 0(c" 2 ). 



(1) 



2m 2w 

Working heuristically at first, we expect that for large c, holomorphic 
solutions of the Klein-Gordon equation can be approximated by 



f(x -iy)= / dp exp [-itu + zx • p - y u + y ■ p] /(p) 



cfp 



(2tt) 2 • 2mc 
exp 



exp 



— it I mc 2 + 



2 U P 

—umc — 



my' 



2m 



+ y • p 



+ ix • p 



/(P) 



x 



(2) 



2m 2-u 

~ (2mc) _1 exp [-irmc 2 - my 2 /2w] / NR (x - iy, t), 

where r = t — iu and / NR is the corresponding holomorphic solution of 
the Schrodinger equation defined in section 4.3. Note that the Gaus- 
sian factor exp[— my 2 /2u] is the square root of the weight function 

for the Galilean coherent states, hence if we choose /(p) G L 2 (IR S ) C 
L+(dp), then 

|| e -m y 2 /2« /NR ||2 2((cs) = {nu/m) s ||/nr|| ^ < QO _ (3) 

We now rigorously justify the above heuristic argument. Let f(z) 

be the function in K corresponding to /(p) and denote by f c its 
restriction to x° = t and y 2 = u 2 c 2 , for fixed u > 0. 

Theorem 4.12. Let u > and /(p) € L 2 (R S ). Then 



J(c) 



\\2mce lTmc 
as c 



fc 



-my 



" /2u f 



nr||l 2 ((D s ) 



oo. 



Proof. Without loss of generality, we set u 
simplify the notation. Note first of all that 



m 



|2ce c /, 



c- f i|2 

c\\L 2 ((S s ) 



4c 2 e 2c A c 
= 4c 2 e 2c2 A r 



(4) 

1 and t = to 

(5) 
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where A c = A\ (A = uc = c). But 

A c = it v K u+1 (2c 2 ) 

= (2c)- 1 W 2 e- 2c2 [l + 0(c- 2 )] 



(6) 



and 



Thus 



< (2c)- 1 (2n)-V\\h i ^y 



(7) 



l|2ce c2 /c||! 2( ^) < (^r s/2 \\f\\h { Mn [1 + 0(c" 2 )] , (8) 
showing that 2ce c2 f c approaches a limit in L 2 ((E S ) as c — > oo. Now 



J(c) = /Y d'xd»y|[( 



c e c 2 -y P _ e -( P -y) 2 /2' 



) ; 1 ~ W 



2 I d s v( - P c2 -yp - e -(p-y) 2 /2' 



(9) 



d s p\f(p)\ 2 I d s y[-e 



Choose a, 7 such that 1/2 < 7 < a < 1. Then 



Ji 



'/"Pl/(P)| 2 / rf s y(-e c2 -^-e-^ 2 / 2 )' 



|p|>c!- a ./IR 

<47r s / 2 || Xc /||! 2(RS) 
— > as c — > 00, 



(10) 



where Xc is the indicator function of the set {p | |p| > c 1 a }. Define 

9 and <p by |y| = csinh# and |p| = csinh(/>. Then y° = cosh# and 

10 = c 2 cosh 0, hence 



> c 2 cosh(# - 0) > c 2 [1 + (0 - 4>f/2] 
Thus for arbitrary a > 0, 



(11) 
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7a(p)= / d s ye 2c2 " 2w 

J Ivl >c cosh a 



|y I >c cosh a 

" r(a/2) J a 

ol-s„s_s/.2 /-oo 

< ' 1, / dff e^ 1 )* (e e + e~°) e~^~^ (12) 

ol-s s s/.2 /-oo 

" r( a /2) A 

ol-s s s/. 2 /-oo 



r( s /2) 



c(a—(f>) — s/2c 



Let a = sinh 1 (c 7 ). Then for |p| < c 1 Q , 



c(a - 0) - s/2c > c [sinh -1 ^" 7 ) - sinh" 1 ^- )] - s/2c = g(c). (13) 

g(c) is independent of p and g(c) ~ c 1-7 as c — > oo. Also, < c _a 
when |p| < c 1_a . Hence 



e 2c 2 -2yp 



^2 = / d"p|/(p)| 2 / d s y 

^IpKc 1 -" •/|y|>c 1 -T 

o2-s s-1 s/2 /-oo 



— > as c — > oo. 

Now 

2c 2 - 2yp = y 2 + p 2 - 2yp = (y- p) 2 

= (yo-^) 2 /c 2 -(y-p) 2 >-(y-p) 1 

Hence 



(15) 



/ efp|/( P )| 2 / d s ye~^ 2 <J 2 , (16) 
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and 



/ ef p |/(p)| 2 / ef y (-e c2 -^ - e -(P-y) 2 /2V 



\y\>c 

< 4 Jo — > as c — > oo. 



Finally, 
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^_ e <?~W _ e _( y _p)2 /2 



d s p\f(p)\ 2 f d s y( 
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where 
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<^|y 2 -p 2 l 



< c" 2 + 
We have used the estimate 



V 7 ! + w 2 - V 7 ! + ^ 2 
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12 2 1 



Hence for sufficiently large c and |p| < c 1 Q , 

] C c 2 8 2 /2 



UJ 



< (1 + 2c 2 5 2 ) + 1-2(1- p 2 /2c 2 ) e^ 2 

< 2 (l - e^ 2 / 2 ) + 2c 2 5 2 + c-^e cH2 l 2 

< 2c 2 5 2 + c" 2q (1 + c 2 5 2 ) 
= h(c) — > as c — > oo. 
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Thus 



Js<Hc) [ d s p\f(p)\ 2 [ cfye" (y - p)2 

J\p\<c 1 ~ a ■/|y|<c 1 -T 



<h(c)^\\f\\l^) (22) 

— > as c — > oo, 
which proves that J(c) — > as c — > oo. I 



Notes 

This chapter represents the main body of the author's mathematics 
thesis at the University of Toronto (Kaiser [1977c]). All the theorems, 
corollaries, lemmas and propositions (labeled 4.1-4.12) have appeared 
in the literature (Kaiser [1977b, 1978a]). In 1966, when the idea of 
complex spacetime as a unification of spacetime and phase space first 
occurred to me, I had found a kind of frame in which both the bras 
and the kets were holomorphic in z and the resolution of unity was 
obtained by a contour integral, using Cauchy's theorem. During a 
seminar I gave in 1971 at Carleton University in Ottawa (where I was 
then a post-doctoral fellow in physics), L. Resnick pointed out to me 
that this "wave-packet representation" appeared to be related to the 
coherent-state representation, which was at that time unknown to 
me. The kets were identical to the canonical coherent states, but the 
bras were not their Riesz duals; in the language of chapter 1, they 
belonged to a (generalized) frame reciprocal to that of the kets, and 
the resolution of unity was of the type given by eq. (24) in section 1.3, 
which may be called a continuous version of biorthogonality. A version 
of this result was reported at a conference in Marseille (Kaiser [1974]). 
I was later informed by J. R. Klauder that a similar representation had 
been developed by Dirac in connection with quantum electrodynamics 
(Dirac [1943, 1946]). 

The original idea of complex spacetime as phase space was to con- 
sider a complex combination of the (symmetric) Lorentzian metric 
with the (antisymmetric) symplectic structure of phase space, obtain- 
ing a hermitian metric on the complex spacetime parametrized by lo- 
cal coordinates of the type x+ibp. (I have since learned that this struc- 
ture, augmented by some technical conditions, is known as a Kahler 



148 



4. Complex Spacetime 



metric; see Wells [1980].) The above "wave-packet representation" 
indicated that this combination may in fact be interesting, but so far 
it was ad hoc and lacked a physical basis. Also, the representation was 
non-relativistic, and it was not at all clear how to extend it to the rel- 
ativistic domain, as pointed out to me by V. Bargmann in 1975. The 
standard method of arriving at canonical coherent states is to use an 
integral transform with a Gaussian kernel in the configuration-space 
representation, and there is no obvious relativistic candidate for such a 
kernel. The more general methods described in chapter 3 do not work, 
since the representations of interest are not square-integrable (section 
4.3). An important clue came in 1974 from the study of axiomatic 
quantum field theory, where I was fascinated by the appearance of 
tube domains. These domains occur in connection with the analytic 
continuation of vacuum expectation values of products of fields, and 
are therefore extensions of such products to complex spcetime. How- 
ever, the complexified spacetimes themselves are not taken seriously 
as possible arenas for physics. They are merely used to justify the 
application of powerful methods from the theory of several complex 
variables, in order to obtain results concerning the restrictions of vac- 
uum expectaion values to reaJ spacetime. (However, the restrictions 
to Euclidean spacetime do have important consequences for statistical 
mechanics; see Glimm and Jaffe [1981].) I felt that if these tube do- 
mains could somehow be given a physical interpretation as extended 
classical phase spaces, this would give the phase-space formulation 
of relativistic quantum mechanics a firm physical foundation, since in 
quantum field theory the extension to complex spacetime is based on 
solid physical principles such as the spectral condition. This idea was 
first worked out at the level of non-relativistic quantum mechanics, 
leading to the representation of the Galilean group given in section 
4.3. That amounted to a reformulation of the canonical coherent- 
state representation in which the Gaussian kernel appears naturally 
in the momentum representation, as a result of the analytic continu- 
ation of solutions of the Schrodinger equation. This "explained" the 
combination x + ibp (section 4.3, eq. (5)) and gave the coherent-state 
representation a dynamical significance. It also cleared the way to 
the construction of relativistic coherent states, since now the Gaus- 
sian kernel merely had to be replaced with the analytic Fourier kernel 
e -iz P on ^ e mass shell. An important tool was the use of groups to 
compute certain invariant integrals, which I learned from a lecture by 
E. Stein on Hardy spaces in 1975. The construction of the relativistic 
coherent states given in sections 4.4 and 4.5 was carried out in 1975- 
76, culminating in the 1977 thesis. Related results were announced at 
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a conference in 1976 (Kaiser [1977a]) and at two conferences in 1977 
(Kaiser [1977d, 1978b]). To my knowledge, this was the first success- 
ful formulation of relativistic coherent states, which have since then 
gained some popularity (see De Bievre [1989], Ali and Antoine [1989]). 
An earlier attempt to formulate such states was made by Prugovecki 
[1976] , but this was shown to be inadequate since the proposed states 
were merely the Gaussian canonical coherent states in disguise, hence 
not covariant under the Poincare group (Kaiser [1977c], remark 4 in 
sec. II. 5 and addendum, p. 133.) After the results of the thesis ap- 
peared in the literature, Prugovecki [1978; see also 1984] discovered 
that they can be generalized by replacing the invariant functions e~ yp 
with arbitrary (sufficiently regular) invariant functions. The price of 
this generalization is that solutions of the Klein-Gordon equation are 
no longer represented by holomorphic functions and the close connec- 
tion with quantum field theory (chapter 5) appears to be lost. The 
relation between the two formalisms and their history was discussed 
at a conference in Boulder in 1983 (Kaiser [1984b]), where an incon- 
sistency in Prugovecki 's formalism was also pointed out. 

The classical limit of solutions of the Klein-Gordon equation in 
the coherent-state representation was studied in Kaiser [1979]. In 
an effort to understand interactions, the notion of holomorphic gauge 
theory was introduced (Kaiser [1980a, 1981]). This is reviewed in 
section 6.1. An early attempt was also made to extend the theory 
to the framework of interacting quantum fields (Kaiser [1980b]), but 
that was soon abandoned as unsatisfactory. A more promising ap- 
proach was developed later (Kaiser [1987a]) and is presented in the 
next chapter. 

Note that our phase spaces cr are not unique, since the configura- 
tion space S can be chosen arbitrarily as long as it is nowhere timelike 
and A > can be chosen arbitrarily. The freedom in S is, in fact, re- 
lated to the probability-current conservation, while the freedom in 
A, combined with holomorphy, allowed us to express the probability 
current as a regularization of the usual current by the use of Stokes' 
theorem (section 4.5, eqs. (37) and (40)). By contrast, the phase 
spaces obtained by De Bievre [1989] are unique. They are "coadjoint 
orbits" of the Poincare group, related to "geometric quantization" 
theory (Kirillov [1976], Kostant [1970], Souriau [1970]). Although 
this uniqueness seems attractive, it involves a high cost: the dynam- 
ics must be factored out. This means that the ensuing theory is no 
longer "local in time." Since one of the attractions of coherent-state 
representations is their "pseudo-locality" in both space and momen- 
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turn, and since in a relativistic theory time ought to be treated like 
space, it seems to me an advantage to retain time in the theory. Per- 
haps a more persuasive argument for this comes from holomorphic 
gauge theory (section 6.1), where a theory describing a free particle 
can, in principle, be "perturbed" by introducing a non-trivial fiber 
metric to obtain a theory describing a particle in an electromagnetic 
(or Yang-Mills) field. This cannot be done in a natural way once time 
has been factored out. 

Some very interesting work done recently by Unterberger [1988] 
uses coherent states which are essentially equivalent to ours to de- 
velop a pseudodifferential calculus based upon the Poincare group as 
an alternative to the usual Weyl calculus, which is based on the Weyl- 
Heisenberg group. Since the Poincare group contracts to a group con- 
taining the Weyl-Heisenberg group in the non-relativistic limit (sec- 
tion 4.2), Unterberger's "Klein-Gordon calculus" similarly contracts 
to the Weyl calculus. 
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5.1. Introduction 

We have regarded solutions of the Klein-Gordon equation as the quan- 
tum states of a relativistic particle. But such solutions also possess 
another interpretation: they can be viewed as classical fields, some- 
thing like the electromagnetic field (whose components, in fact, sat- 
isfy the wave equation, which is the Klein-Gordon equation with zero 
mass). This interpretation is the basis for quantum field theory. The 
general idea is that just as the finite number of degrees of freedom of a 
system of classical particles was quantized to give ordinary ( "point" ) 
quantum mechanics, a similar prescription can be used to quantize the 
infinite number of degrees of freedom of a classical field. It turns out 
that the resulting theory implies the existence of particles. In fact, the 
asymptotic free in- and out-fields are represented by operators which 
create and destroy particles and antiparticles, in agreement with the 
fact that such creation and destruction processes occur in nature. 
These particles and antiparticles are represented by positive-energy 
solutions of the asymptotic free wave equation, e.g. the Klein-Gordon 
or Dirac equation. Thus the formalism of relativistic quantum me- 
chanics appears to be, at least partially, absorbed into quantum field 
theory. 

In regarding solutions of the Klein-Gordon equation as the phys- 
ical states of a relativistic particle, it was appropriate to restrict our 
attention to functions having only positive-frequency Fourier com- 
ponents, since the energy of the particle must be positive. Even a 
small negative energy can be made arbitrarily large and negative by 
a Lorentz transformation, leading to instability. When the solutions 
are regarded as classical fields, however, no such restriction on the 
frequency is necessary or even justifiable. For example, in the case 
of a neutral field (i.e., one not carrying any electric charge), the 
solutions must be real- valued, hence their Fourier transforms must 
contain negative- as well as positive-frequency components. On the 
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other hand, the analytic extension of the solutions to complex space- 
time appeared to depend crucially on the positivity of the energy. We 
must therefore ask whether an extension is still possible for fields, or 
if it is even desirable from a physical standpoint, since the connec- 
tion between solutions and particles is not as immediate as it was 
earlier. In this chapter we find an affirmative answer to both of these 
questions. A natural method, which we call the Analytic-Signal trans- 
form, will be developed to extend arbitrary functions from IR S+1 to 
(E s+1 , and when the functions represent physical fields, the double 
tube T = T + U 71 in (E s+1 will be shown to have a direct physical sig- 
nificance as an extended classical phase space, not for the fields them- 
selves but for certain "particle" - and "antiparticle" coherent states ef 
associated with them. These states are related directly to the dynami- 
cal (interpolating) fields, not their asymptotic free in- and out-fields. 
To be precise, they should be called charge coherent states rather 
than particle coherent states, since they have a well-defined charge 
whereas, in general, the concept of individual particles does not make 
sense while interactions are present. If the given fields satisfy some 
(possibly non-linear) equations, the coherent states satisfy a Klein- 
Gordon equation with a source term. Hence they represent dynamical 
rather than "bare" particles. For free fields, e+ reduces to the state e z 
defined in the last chapter and e~ to its complex conjugate, which is a 
negative-energy solution of the Klein-Gordon equation holomorphic 
inT_. 

Complex tube domains also appear in the contexts of axiomatic 
and constructive quantum field theory, and our results suggest that 
those domains, too, may have interpretations related to classical phase 
space, a point of view which, to my knowledge, has not been explored 
heretofore.* 

While our extended fields are not analytic in general, they are 
"analyticity-friendly," i.e. have certain features which yield various 
analytic objects under different circumstances. For example, their 
two-point functions are piecewise analytic, and the pieces agree with 
the analytic Wightman functions. In the special case when the given 
fields are free, the extended fields themselves are analytic in T. Fur- 
thermore, the fields in general possess a directional analyticity which 



* R. F. Streater has recently told me that G. Kallen was infor- 
mally advocating the interpretation of the holomorphic Wightman 
two-point function as a correlation function in phase space around 
1957. Nothing appears to have been published on this, however. 
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looks like a covariant version of analyticity in time. Since the lat- 
ter forms the basis for the continuation of the theory (in the form of 
vacuum expectaion values) from Lorentzian to Euclidean spacetime 
(see Nelson [1973a, b] and Glimm and Jaffe [1981]), it may be that our 
extended fields, when restricted to the Euclidean region, bear some 
relation to the corresponding Euclidean fields. 

The formalism we are about to develop for fields is a natural 
extension of the one constructed for particles in the last chapter. Like 
its predecessor, it posesses a degree of regularity not found in the 
usual spacetime formalism. Some examples of this regularity are: 

(a) The extended fields <f>(z) are, under reasonable assumptions, oper- 
ator-valued functions (rather than distributions, as usual) when 
restricted to T. 

(b) The theory contains a natural, covariant ultraviolet damping, 
which is a permanent feature of the theory. This comes from 
the possibility of working directly in phase space, away from real 
spacetime. From the point of view of the usual (reai spacetime) 
theory, our formalism looks like a "regularization" . From our 
point of view, however, no regularization is necessary since, it 
is suggested, reality takes place in complex spacetime! In other 
words, this "regularization" is permanent and is not to be re- 
garded as a kind of trick, used to obtain finite quantities, which 
must later be removed from the theory. 

(c) In the case of free fields, the formalism automatically avoids zero- 
point energies without normal ordering, due to a polarization of 
the positive- and negative frequency components into the forward 
and backward tubes, respectively. Observables such as charge, 
energy-momentum and angular momentum are obtained as con- 
served integrals of bilinear expressions in the fields over phase 
spaces a C T. These expressions, which are densities for the 
corresponding observables, look like regularizations of the corre- 
sponding expressions in the usual spacetime theory. The analytic 
(Wightman) two-point function acts as a reproducing kernel for 
the fields, much as it did for the wave functions in chapter 4. 

(d) The particles and antiparticles associated with the free Dirac field 
do not undergo the random motion known as Zitterbewegung 
(Messiah [1963]), again because of the aforementioned polariza- 
tion. 
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5.2. The Multivariate Analytic— Signal Transform 

As mentioned above, in dealing with physical fields such as the electro- 
magnetic field, rather than quantum states, we can no longer justify 
the restriction that frequencies must be positive. For one thing, as we 
shall see, in the presence of interactions there is no longer a covari- 
ant way to eliminate negative frequencies. Hence the method used in 
chapter 4 to analytically continue solutions of the Klein-Gordon equa- 
tion to complex spacetime will no longer work directly. In this section 
we devise a method for extending arbitrary functions from III s " 1 " 1 to 
(E s+1 . When the given functions are positive-energy solutions of the 
Klein-Gordon or the Wave equation, this method reduces to the an- 
alytic extension used in chapter 4. But it is much more general, and 
will enable us to extend quantized fields, whether Bose or Fermi, in- 
teracting or free, to complex spacetime. We begin by formulating the 
method for functions of one variable, where it is closely related to the 
concept of analytic signals. For motivational purposes, we think of 
the variable as time (s = 0). In this chapter, Fourier transforms will 
usually be with respect to spacetime (IR S+ ) rather than just space 

(IR S ). Hence we will denote them by /, reserving / for the spatial 
Fourier transform, as done so far. 

Suppose we are given a "time-signal," i.e. a real- or complex- 
valued function of a single real variable x. To begin with, assume that 
/ is a Schwartz test function, although most of our considerations will 
extend to certain kinds of distributions. Consider the positive- and 
negative- frequency parts of /, defined by 



Then / + and /_ extend analytically to the lower-half and upper-half 
complex planes, respectively, i.e. 




(1) 




(2) 
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/+ and /_ are just the Fourier-Laplace transforms of the restrictions 
of / to the positive and negative frequencies. 

If / is complex- valued, then / + and /_ are independent and the 
original signal can be recovered from them as 

f(x) = ]im\f+(x-iy) + f-(x + iy)]. (3) 
yio 

If / is real-valued, then 



Kp) = K-p), (4) 

hence / + and /_ are related by reflection, 



/+(*) = /_(*), zGflT, (5) 

and 

f(x) = \im2ftf + (x-iy) = lim2K/_(x + iy). (6) 

ylO ylO 

When / is real, the function f+(z) is known as the analytic signal 
associated with f(x). A complex-valued signal would have two in- 
dependent associated analytic signals /+ and /_. What significance 
do f± have? For one thing, they are regularizations of /. The above 
equation states that / is jointly a boundary-value of the pair / + 
and /_. As such, / may actually be quite singular while remaining 
the boundary-value of analytic functions. Also, f± provide a kind 
of "envelope" description of / (see Klauder and Sudarshan [1968], 
section 1.2). For example, if f(x) = cos ax (a > 0), then f±(z) = 
| exp(=|=ia,2), so the boundary values are f±(x) = \ exp(=)=iax). 

In order to extend the concept of analytic signals to more than 
one dimension, let us first of all unify the definitions of f+ and /_ by 
defining 

/oo 
dp6(yp)e-^-^f(p) (7) 
-oo 

for arbitrary x — iy G (E, where 9 is the unit step function, defined by 

r0, u < 

9{u) = < 1/2, u = (8) 
U, u>0. 
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Then we have 

(/+(*), y>o 

f(z)={±f(x), y = (9) 

U-(*), y<o. 

[The apparent inconsistency f(x) = \f{x) for y = is due to a mild 
abuse of notation. It could be removed by redefining f(z) by a factor 
of 2 or, more correctly but laboriously, rewriting it as (Sf)(z). We 
prefer the above notation, since the boundary-values f(x) will not 
actually be used in the phase-space formalism.] 
Let us define the exponential step function by 

0< = 0(-&C) e <, (10) 
so that our extension is given by 

/oo 
d P 9-^f(p). (11) 
-oo 

The identity 

9(u)9{u') = 9{uu')9{u + u') (12) 
shows that 9^ has the "pseudo-exponential" property 

e c e c ' = 9(^c^C')o c+c \ (13) 

which will be useful later. 

Although this unification of / + and /_ may at first appear to be 
somewhat artificial, we shall now see that it is actually very natural. 
Note first of all that for any real u, we have 

e( u )e- u = — -^—e lTU , (14) 

since the contour on the right-hand side may be closed in the lower 
half-plane when u < and in the upper half-plane when u > 0. For 
u = 0, the equation states that 

m 1 r(r + ,)dr 



^ij-oo r 2 + l 

i r ir i < 16 > 



2tt t' + 1 2 
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in agreement with our definition, if we interpret the integral as the 
limit as L — > oo of the integral from — L to L. The exponential step 
function therefore has the integral representation 

1 POO J 

0-i(x-iy)p _ 1 / ttT e -i(x-ry)p /^gN 

2mJ_ 00 T-i 

If this is substituted into our expression for f(z) and the order of 
integrations on r and p is exchanged, we obtain 

I f 00 d>T 

f(x-iy) = — ;f(x-ry) (17) 

2m J_ oc t - i 

for arbitrary x — iy G (E. We shall refer to the right-hand side as the 
Analytic-Signal transform of f(x). It bears a close relation to the 
Hilbert transform, which is defined by 

1 Z" 00 dii 

(Hf)(x) = -PV -f(x-u), (18) 

where PV denotes the principal value of the integral. Consider the 
complex combination 



f( x )-i(Hf)(x) = - Id 

J-OO 

= — lim / 

Til ej.0 J_ 



iri5{u) +PV- 



u 



f(x - u) 



du 



oo 
oo 



u — te 

= — lim / — — 

iri 40 J_ 00 r - i 

= lim 2f(x — ie). 

e|0 



- f(x - u) 



- f(x - re) 



(19) 



Similarly, 



Hence 



f(x)+i(Hf)(x)=]im 2f(x + ie). 

e|0 



(20) 



(Hf)(x) = -i lim [f{x + ie) - /(a; - ie)], 

ej.0 



(21) 



158 



5. Quantized Fields 



which, for real-valued /, reduces to 



(Hf)(x) = lim 2%f(x + ie) 



40 



lim 2Qf(x — ie). 

e|0 



(22) 



We are now ready to generalize the idea of analytic signals to an 
arbitrary number of dimensions. 

Definition. Let / E S(IR S+1 ). The Analytic-Signal transform of / 
is defined by 

/(x- i „) = -/_ oo ^/(x-r„). (23) 
The same argument as above shows that 



We shall refer to the right-hand side of this equation as the (inverse) 
Fourier-Laplace transform of f in the half-space 



The integral converges absolutely whenever / e L 1 (IR S ), since 
\9~ %zp \ < 1. Hence f(z) can actually be defined for some distribu- 
tions, not only for test functions. The extension of the transform 
to distributions is complicated by the fact that 0~ tzp is not a test 
function in the variable p, hence for a tempered distribution T, 



(defined through the Fourier transform T of T) may not make sense 
as a function on (D s+1 . It would be intersting to find a natural class 
of distributions for which T(z) does make sense as a function. In 
general, however, it may be necessary to consider distributions T such 
that T(z) is some kind of distribution on (E s+1 . The solutions to 
both of these problems are unknown to me, so a certain amount of 
vagueness will be necessary on this point. In the next section, where 
T is a quantized field, it will be assumed to satisfy some physically 
reasonable conditions which imply that T(z) is well-defined in an 
important subset T of (C s+1 . 




(24) 



M y = {pe H 



VP > 0}. 



(25) 



T(z) = T(6- tzp ) 



(26) 



5.2. The Multivariate Analytic-Signal Transform 
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Recall that for s = 0, f(z) was analytic in the upper- and lower- 
half-planes. In more than one dimension, f(z) need not be analytic, 
even though, for brevity, we still write it as a function of z rather 
than z and z. However, f(z) does in general possess a partial ana- 
lyticity which reduces to the above when s = 0. Consider the partial 
derivative of f(x — iy) with respect to z M , defined by 

df df ( 27 ) 

i- 



dx^ dyv 

Then / is analytic at z if and only if d^f = for all [i. But using our 
definition of f(z), we find that 

2d,f(x -iy) = ^ J^dr^ix-ry). (28) 

The right-hand side is known as the X-Ray transform (Helgason 
[1984]) of the function df/dx^ 1 , given in terms of the parameters x 
and y defining the line xir) = x — ry. It follows that the complex 
d-derivative in the direction of y vanishes, i.e. 

Any^d^f(z) = J°° dry^(x - ry) 

r a 9 ft \ ( 29 ) 

= - j_Jr-f{x-ry) 
= 0, 

if / decays for large x (e.g., if / is a test function). Equivalently, using 
2«9 M [6(yp)e- iz *] =2^ [6(yp)] e~ iz * 



_ . d9{yp) izp 



dy» (30) 
= -ip^5{yp) e~ lzp 
= -ip f ,d(yp)e- ixp , 

we have 

2idJ(z) = (2n)- s - 1 [ dpp^(yp)e-^f(p), (31) 
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so Anid^f is the inverse Fourier transform of p lx f{p) in the hyperplane 

N y = {p G M s+1 \yp = 0} = 8M y . (32) 

Hence, if the intersection of the support of / with N y has positive 
Lebesgue measure in N y , then / will not be analytic at x — iy in 
general. However, in any case, 



2ty%f(z) = (2n)- s - 1 [ dp yp8(yp) f(p) = 0, (33) 

in agreement with the above conclusion. In the one-dimensional case 
s = 0, this reduces to 

^=0 Vy^O, (34) 

which states that f(z) is analytic in the upper- and lower- half- 
planes. The point is that in one dimension, there are only two imagi- 
nary directions (up or down), whereas in s + 1 dimensions, every y ^ 
defines a direction. This motivates the following. 
Definition. Let Y{z) be a vector field of type (0,1) on (E s+1 , i.e. 
Y = Y fJ, (z) <9 M . Then a function / on (D s+1 is holomorphic along Y if 

Yf(z)=Y^(z)dJ(z) = 0. (35) 

Thus, our Analytic-Signal transform f(z) is holomorphic along 

Y(x - iy) = 

Note: The "functorial" way to look at f(z) is as an extension of f(x) 
to the tangent bundle T(IR S+1 ). Then the above states that f(z) 
is holomorphic along the fibers of this bundle. It makes sense that 
f(z) ought to satisfy some constraints, since it is determined by a 
function f(x) depending on half the number of variables. If IR S+ is 
replaced by a differentiable manifold, the line x(r) = x — ry would 
have to be replaced by a geodesic. This gives a generalized transform 
which can be used to extend functions on an arbitrary Riemannian 
or Pseudo-Riemannian manifold, such as a curved spacetime, to its 
tangent bundle. # 

The multivariate Analytic-Signal transform is related to the Hil- 
bert transform in the direction y (Stein [1970], p. 49), defined as 
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1 f°° clii 

(H y f)(x) = -PV -f( x - uy ), x ,yeM s+ \y^O. (36) 

7T J-oo w 

Namely, an argument similar to the above shows that 

f(x) + i(H y f)(x) = lim 2f(x + ley), (37) 

ej.0 

hence 

(H y f)(x) = -i\im[f(x + iey) - f(x-iey)], (38) 

which, for s = 0, reduces to the previous relation with the ordi- 
nary Hilbert transform. As in the one-dimensional case, f(x) is the 
boundary-value of f(z) in the sense that 

f(x) = lim [f(x + iey) + f(x - iey)]. (39) 

For real-valued /, eqs. (38) and (39) reduce to 

f(x) = lim 2$tf(x + iey) 
(H y f)(x)= lim 2%f(x + iey). (40) 

Note: The Analytic-Signal transform is remarkable in that it com- 
bines in a single entity elements of the Hilbert, Fourier-Laplace and 
X-Ray transforms. In fact, it is an example of a much more gen- 
eral transform which, furthermore, includes the Radon transform and 
an n-dimensional version of the wavelet transform as special cases! 
(Kaiser [1990b,c]). 



5.3. Axiomatic Field Theory and Particle Phase Spaces 

We begin with a study of general fields, i.e. fields not necessar- 
ily satisfying any differential equation or governed by any particular 
model of interactions. The Analytic-Signal transform defines (at least 
formally) a canonical extension of such fields to complex spacetime. 
In this section we show that for general quantized fields satisfying 
the Wightman axioms (Streater and Wightman [1964]), the proposed 
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extension is natural and interesting. In particular, the double tube 
domain T = T + U 71_ (where y e V± in T±) can be interpreted as 
an extended classical phase space for certain "particle"- and "an- 
tiparticle" coherent states naturally associated with the quantized 
fields. Although the extended fields are in general not analytic (they 
need not even be functions, only ditributions) , they are "analyticity- 
friendly" in the sense that various objects associated with them are 
analytic functions. Consequently, they are more regular than the 
original fields, which are their boundary values. In particular, we will 
see that under some reasonable assumtions, the extended fields are 
operator-valued functions (rather than distributions) when restricted 
to T. The extensions of free fields and generalized free fields are, 
moreover, weakly holomorphic in T. 

Recall (chapter 4) that a system with a finite number of degrees of 
freedom, say a free classical particle, can be quantized in at least two 
ways: by replacing the position-and momentum variables with op- 
erators satisfying the canonical commutation relations (this is called 
canonical quantization), or by considering unitary irreducible repre- 
sentations of the underlying dynamical symmetry group (Q2 for non- 
relativistic particles and Vo for relativistic particles). The second 
scheme seems to be more natural, especially in the relativistic case, 
where position variables are not a covariant concept. But the first 
scheme has the advantage that interactions (potentials) can be intro- 
duced more easily, since it generates a kind of functional calculus for 
operators. 

Both schemes have counterparts in field quantization. In canoni- 
cal quantization, the particle position is replaced with the initial con- 
figuration of the field 0, i.e. with the values of <f>(x) at all space points 
x at some initial time x° = t; its momentum, according to Lagrangian 
field theory, then corresponds to the time-derivative of the field at 
time t. Thus the "phase space" of the field is simply the set of all 
possible initial data on the Cauchy surface xq = t in real spacetime. 
Quantization is implemented by requiring the initial field and its con- 
jugate momentum to satisfy an infinite-dimensional generalization of 
the canonical commutation relations. Although this is the standard 
approach to field quantization in the physics literature, its physical 
and mathematical soundness is open to question. Whereas the two 
schemes are equivalent when applied to non-relativistic quantum me- 
chanics, it is not clear that they remain equivalent when applied to 
field quantization, in general. However, they are equivalent for the 
quantization of free fields. We will apply canonical quantization to 
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the free Klein-Gordon and Dirac fields in sections 5.4 and 5.5. 

The second approach to field quantization is subsumed in the 
so-called axiomatic approach to quantum field theory. The unitary 
representations of Vo under which the fields transform are no longer ir- 
reducible. This is a consequence of the fact that while Vq is transitive 
on the phase space of a classical particle (i.e., any two locations, ori- 
entations and states of motion can be transformed into one another), 
it is no longer transitive on the phase space of a classical field (two 
sets of initial conditions need not be related by Vo). The reducibility 
of the representation corresponds to the presence of an infinite set of 
degrees of freedom or, at the particle level, to an indefinite number 
of particles. (Even two particles would result in reducibility, since 
the Lorentz norm of their total energy-momentum can be arbitrar- 
ily large.) Consequently, the representation is now characterized by 
an infinite number of parameters and no longer uniquely determines 
the theory, as it did (for a given mass and spin) when it was irre- 
ducible. On the other hand, the label space for the configuration 
observables, which for N particles in IR S was {1, 2, 3, ■ ■ ■ , sN}, is now 
IR S , and Lorentz invariance means that the "labels" x mix with the 
time variable. This gives an additional structure to quantized fields 
not shared by ordinary quantum mechanics, and some of this struc- 
ture is codified in terms of the Wightman axioms, partly making up 
for the indeterminacy due to the reducibility of the representation. 

For simplicity of notation we confine our attention to a single 
scalar field. The results of this section extend to an arbitrary system 
of scalar, spinor or tensor fields. Thus, let <p(x) be an arbitrary scalar 
quantized field. "Quantized" means that rather than being a real- 
or complex- valued function on spacetime (like the components of the 
classical electromagnetic field), (p(x) is an operator on some Hilbert 
space 7i. Actually, as noted by Bohr and Rosenfeld [1950], quantized 
fields are too singular to be measured at a single point in spacetime. 
This led Wightman to postulate that (j) is an operator-valued distribu- 
tion, i.e., when smeared over a test function f(x) it gives an operator 
<fi(f) (unbounded, in general) on H. The axiomatic approach turns 
out to provide a surprisingly rich mathematical framework common to 
all quantum field theories; therefore, it is model-independent. It was 
followed by constructive quantum field theory, in which model field 
theories are built and shown to satisfy the Wightman axioms. For 
simplicity, we state the axioms in terms of the formal expression <p(x) 
rather than its smeared form <p( f). Also, we assume, to begin with, 
that the field 4> is neutral, which means that the operators 4>(x) (or, 
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rather, their smeared forms 4>{f) with / real-valued) are essentially 
self-adjoint. Charged fields, which are described by non-Hermitian 
operators, and spinor (Dirac) fields, will be considered later. The 
axioms are: 

1. Relativistic Invariance: Poincare transformations are implemen- 
ted by a continuous unitary representation U of Vo acting on the 
Hilbert space Ti of the theory. (For particles of half-integral spin, 
such as electrons, Vo must be replaced by its universal cover; see 
section 5.5.) Thus, 

(j>(Ax + a) = U(a,A)(j>(x)U(a,A)*. (1) 

Let and M^ v be the self-adjoint generators of spacetime trans- 
lations and Lorentz trnsformations, respectively. They are in- 
terpreted physically as the total energy-momentum and angular 
momentum operators of the field — or, more generally, of the sys- 
tem of (possibly coupled) fields. They satisfy the commutation 
relations of the Lie algebra p of Vo- In particular, the P^'s must 
commute with one another, thus have a joint spectrum E C IR S+1 . 
(Actually, E is a subset of the dual (IR S+ ) of IR S+ , which we 
may identify with IR S+ using the Minkowski metric; see section 
1.1.) 

2. Vacuum: The Hilbert space contains a unit vector called the 
vacuum vector, which is invariant under the representation U, i.e. 
U (a, A)^o = for all (a, A) eVq. is unique up to a constant 
phase factor, and it is cyclic, meaning that the set of all vectors 
of the form 

0(/l)0(/ 2 ) ••>(/n)*0 (2) 

spans 7i. Invariance under Vo means that ^>o is a common eigen- 
vector of the generators P M and M^ u with eigenvalue zero: 

Pu^o = 

(3) 

M^o = 0. 

3. Spectral Condition: The joint spectrum E of the energy-momen- 
tum operators P M is contained in the closed forward light cone: 

E C V+. (4) 

This axiom follows from the physical requirement of stability, 
which merely states that the energy is bounded below. To see 
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this, note first of all that S must be invariant under Cq, by Ax- 
iom 1. Hence, if any physical state had a spectral component 
with p (jz V+ , that component could be made to have an arbi- 
trarily large negative energy by a Lorentz transformation. Note 
that the existence and uniqueness of the vacuum means that E 
contains the origin in its point spectrum with multiplicity one. 

4. Locality: The field operators <f>(x) and 4>(x') at points with space- 
like separation commute, i.e., 

[(j>(x),(j>(x')]=0 if (x - x') 2 < 0. (5) 

This corresponds to the physical requirement that measurements 
of the field at points with spacelike separations must be indepen- 
dent, since no signal can travel faster than light. (For fields of 
half-integral spin, such as the Dirac field treated in section 5.5, 
commutators must be replaced with anticommutators.) 

5. Asymptotic Condition: As the time xq — > ±oo, the field (f>(x) has 
weak asymptotic limits 

4>{x) — > 4>i a {x) (weakly) as xq — > — oo 

(6) 

4>{x) — > (j) ov x{x) (weakly) as x$ — > oo. 

0i n and ou t are free fields of mass m > 0, i.e. they satisfy the 
Klein-Gordon equation: 

(□ + m 2 ) in = (□ + m 2 ) out = 0. (7) 

Physically, this means that in the far past and future, all particles 
are sufficiently far apart to be decoupled, and that (f> interpolates 
the in- and out- fields. Furthermore, the Hilbert spaces on which 
0in,0out and (j> operate all coincide: 

Hin = ^out = 7~t- (8) 

In addition to the above, there are axioms concerning the regularity 
of the distributions 4>{x) (they are assumed to be tempered, i.e. the 
test functions belong to iS(IR' s+1 )) and the domains of the smeared 
operators </>(/), which we use implicitly, and clustering, which we will 
not use here. 

Let us now draw some conclusions from these axioms. Since the 
energy-momentum operators generate spacetime translations, it fol- 
lows from eq. (1) that 
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id^(x) = [ct>(x),P fl }. (9) 
For the Fourier transform this means that 

14>(p),p^=p^(p). (10) 

Consequently for any p G 1R S+1 , the "vector" 

$ p = 4>(-p) * (11) 

(which is in general non-normalizable) satisfies 

P^ p = 0(-p)]* o = PA- (12) 

That is, $ p is a generalized eigenvector of energy-momentum with 
eigenvalue p, unless it vanishes. The spectral condition therefore re- 
quires that 

0(-p)*o = O Vp^Ei, (13) 

where Si is the intersection of the spectrum E with the support of 
the distribution 0. More generally, if \l/p is any generalized eigen- 
vector of energy-momentum with eigenvalue p G E, then the above 
commutation relations show that 

P m ^')^ = (Pm-^)^ / )* p , (14) 

thus either </>(p') ^ p vanishes or it is a generalized eigenvector of 
energy-momentum with eigenvalue p — p' ', a necessary condition for 

which is that p — p' G E. Thus we conclude that the operator (j)(p), 
when it does not vanish, removes an energy-momentum p from the 
Geld. This establishes the physical significance of the field operators, 
as well as the connection between the (mathematical) Fourier vari- 
able p and the (physical) energy-momentum operators P^. From the 
Jacobi identity, it follows that 

[ [fa), 4>(p%p,] = (p„ + [Hp), 4>( p % (15) 

showing that [0(p),0(p')] (^ n °t zero ) removes a total energy-mo- 
mentum p + p' from the field. The cyclic property of the vacuum, 
furthermore, implies that the entire spectrum E is generated by re- 
peated applications of <j> to the vacuum. Note that in general we 
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cannot draw any conclusions about the support of 4>{p). For example, 
(f)(p) need not vanish for spacelike p, since E may contain points p' 
such that p' — p G S. In fact, very strong conclusions can be drawn 
from the nature of the support of (p. From Lorentz invariance and 
<j>{p)* = 4>(—p) it follows that (j)(p) = if and only if 4>{p') = 0, where 
p' = ±Ap for some A G Co- We conclude that the support of must 
be a union of sets of the form 





= {p | p 2 


= m 2 }, 


m > 




={p\p 2 


= 0, p ± 0} 




^00 


= {0} 








= {p\p 2 


= -m 2 }, 


m > 0, 



which are, in fact, the various orbits of the full Lorentz group C 
Greenberg [1962] has shown that <p is a generalized free field (i.e., a 

sum or integral of free fields of varying masses m > 0) if vanishes 
on any of the following types of sets: 

A = flim, m > 

S = O 00 U |J O m , M>0 

0<m<M (17) 

C= |J O m , M>0. 

m>M 

He has also shown, by giving counter-examples, that this conclusion 
cannot be drawn if vanishes on sets of the type 

D = |J fi m , < Mi < M 2 

Mi<m<M 2 (18) 
-E 1 = ^00- 

(See also Dell' Antonio [1961] and Robinson [1962].) 
Note: Up to now, we have not assumed that the field satisfies the 
canonical commutation relations (section 5.4), hence our conclusions 
are quite general and should hold for an arbitrary (system of mutu- 
ally) interacting field(s). The "Lie algebra" generated by the fields 
(obtained by including, along with the fields, their commutators 
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[4>(p),4>(p')] as well as higher-order commutators) has a very inter- 
esting formal structure, although it is not a Lie algebra in the usual 
sense. (For one thing, it is uncountably infinite-dimensional rather 
than finite-dimensional.) Namely, the above relations suggest that 
the operators P M be regarded as belonging to a Cartan subalgebra 

and that <p(p) (with p in the support of <p) is a root vector with as- 
sociated root —p. The spectrum E is therefore reminiscent of a set 
A + of positive roots. In general, the Cartan subalgebra consists of 
a maximal set of commuting observables. When considering charged 
fields, the charge will also belong to the Cartan subalgebra, with root 
values 0, ±£, ±2e, . . ., where £ is a fundamental unit of charge. The 
vacuum is a vector (not in the Lie algebra but in an associated rep- 
resentation space) of "highest" (or lowest) weight which, as in the 
finite-dimensional case, generates a representation of the algebra be- 
cause of its cyclic property. To my knowledge, this important analogy 
between the structures of general quantized fields (i.e., apart from the 
canonical commutation relations or any particular models of interac- 
tions) and Lie algebras has not been explored, although the methods 
of Lie-algebra theory could add a powerful new tool to the study of 
quantized fields. (In a somewhat different context, the structures of 
quantized fields and infinite-dimensional Lie algebras are united in 
string theory; see Green, Schwarz and Witten [1987].) # 

Let us now formally extend the quantized field <f>(x) to (E s+1 , us- 
ing the Analytic-Signal transform developed in section 5.2. Recall 
that this transform was originally defined for Schwartz test functions. 
In principle, we would like to define <f>(z) by using its distributional 
Fourier transform (f>(p): 

<l>{z) = (2n)- s ~ 1 [ dpe-^fo) 

Jm.^ 1 (19) 

= 4> (e~ izp ) . 

This presents us with a technical problem, as already noted in the last 
section, since 9~ lzp is not a Schwartz test function in p. One way out 
is to smear <f>(z) with a test function f(z) over (D s+1 . Although this is 
the safest solution, it is not very interesting since not much appears 
to have been gained by extending the field to complex spacetime: the 
new field is still an operator-valued distribution. However, we shall 
see that there are reasons to expect <p{z) to be more regular than a 
"generic" Analytic-Signal transform, due in part to the fact that 4>{x) 
satisfies the Wightman axioms. When is a (generalized) free field, 
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the restriction of <f>(z) to the double tube T turns out to be a holo- 
morphic operator-valued function. We will see that even for general 
Wightman fields, T is an important subset of (E s+1 . In the presence of 
interactions, holomorphy is lost but some regularity in T is expected 
to remain. We now proceed to find conditions which do not force <fi to 
be a generalized free field but still allow <p(z) to be an operator-valued 
function on T. The arguments given below have no pretense to rigor; 
they are only meant to serve as a possible framework for a more pre- 
cise analysis in the future. All statements and conditions concerning 
convergence, integr ability and decay of operator- valued expressions 
are meant to hold in the weak sense, i.e. for matrix elements between 
fixed vectors. Since the operators involved are unbounded, we must 
furthermore assume that the vectors used to form the matrix elements 
are in their (form) domains. 

For a fixed timelike "temper" vector y, 0~ lzv fails to be a 
Schwartz test function in two distinct ways: (a) It has a disconti- 
nuity on the spacelike hyperplane N y = {p \ yp = 0}, and (b) it has a 
constant modulus on hyperplanes parallel to N y , hence cannot decay 

there. On the other hand, by relativistic covariance, the support of </> 
must be smeared over the orbits of £, given by eq. (16). This gives a 
"stratification" of (p as a sum of tempered distributions 

4> = 4>+ + 4> + 4>oo + 4>- (20) 

with support properties 



supp 4> + C (J fi m = fi + 

m>0 
SUpp 00 Q ^0 
supp 000 Q ^00 
supp 4>- C [J O im = 

m>0 

Although 0+ and fi_ contain Qq, the distributions 0+ and 0_ 
have no contributions from p 2 = 0. (For example, the distribution 
1 + S(j>) on IR has a decomposition T\ + To where T\ has support 
IR and T has support {0} C IR, but the p = contribution to T\ 
vanishes.) Similarly, although fi contains O o, 4>o has no contribution 
from p = 0. Corresponding to the above decomposition of 0, we have, 
formally, 



(21) 
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<f>(z) = <f>+(z) + <f> (z) + (poo(z) + (j)-{z). 



(22) 



We will show that 

1. 4>+(z) and (f>o(z) are holomorphic operator-valued functions in T; 

2. 4>oo(z) must be a constant field, to be physically reasonable; and 

3. under certain (hopefully not too restrictive) conditions, 0_(z), 
though not holomorphic, is an operator-valued function on T. 

First of all, we claim that each of the fields <f> a , (a = ±,0,00) is 
still covariant under Vq* To see this, take the Fourier transform of 
eq. (1) and use the invariance of Lebesgue measure dp under Cq. This 
leads to 



where A' is the transpose of A G Co. Since the different components 
are "essentially" supported on disjoint subsets and these subsets are 
invariant under £ , we conclude that eq. (23) holds for each <j) a . 
To show that (f>+(z) and <j>o(z) are holomorphic in T + , let z e T + . 

Since 0~ iz p vanishes for p e F_\{0}, and since 4>+ and (fio have no 
contribution from p = 0, we have 



Now e~ tzp may be regarded as the restriction to V+ of a Schwartz 
test function f z (p) which is of compact support in p for each fixed po 
and vanishes when po < —E, for some E > 0. Thus <j)+(z) and 4>o(z) 
make sense as operator- valued functions on 7+, and they are cleary 
holomorphic there. (This will be shown explicitly below.) A similar 
analysis shows that the same can be done for z E 71 . 

Next, we consider (j>oo(z). Since 0oo is supported on {p = 0}, 
it must have the form P(d)8(p), where P(d) is a partial differential 
operator. In the x-domain (i.e., in real spacetime), this corresponds to 
a polynomial P(—ix), for which the Analytic-Signal transform is not 
well-defined, although it is possible that a regularization procedure 
would cure this. But in any case, non-constant polynomials in x 
do not appear to be of physical interest since they correspond to 



* However, <p a need not satisfy other Wightman axioms such as 
locality; for example, 0+ need not be a generalized free field. 



(j)(p) = e ia P U(a, A) <f>{Mp) U(a, A)*, 



(23) 




(24) 
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unbounded fields even in the classical sense (as functions of x). Hence 
we assume that 

(a) 4>oo(p) = 2A5(p), where A is a constant operator. 

This corresponds to a constant field (j>(x) = 2A and, correspondingly, 
0oo (z) = A (see eq. (15) in section 5.2). In order that <f>(z) be an 
operator-valued function on T, it therefore remains only for <p- (z) to 
be one. Note that so far, the only assumption we needed to make, in 
addition to the Wightman axioms, was (a). To make </>_ (z) a function, 
we now make our second assumption: 

(b) <j)-(p) is integrable on all spacelike hyperplanes. Furthermore, the 
integral of <j> over the hyperplane H y v = {p\yp = v} (y G V) 
grows at most polynomially in v. 

It is not clear what specific minimal conditions on 0_ produce this 
property. The integral occurring in (b) is known as the Radon trans- 
form (R(p)(y, v) of (j) when y is a (Euclidean) unit vector, and will be 
further discussed in section 6.2. (See also Helgason [1984].) Unlike the 
Fourier transform, the Radon transform does not readily generalize 
to tempered distributions (which were, after all, designed specifically 
for the Fourier transform). However, it does extend to distributions 
of compact support and can be further generalized to distributions 
with only mild decay. Also, the relation of assumptions (a), (b) (or 
their future replacements, if any) to the Wightman axioms needs to 
be investigated. 

In order to compute <f>-(z) for z G T it suffices, by covariance, to 
do so for x = and y = (u, 0), for all u ^ 0. The analyses for u > 
and u < are similar, so we restrict ourselves to u > 0. Eq. (19) then 
gives 

<P-(-iu,0) = (2n)- s - 1 [ dp Q e- upo [ d s p 0_(p o ,p). (25) 

JO J TR S 

For fixed po > 0, condition (b) implies that the integral over p con- 
verges, giving an operator-valued function F(j>q) which is of at most 
polynomial growth in p . (f)_(—iu,0) is then the Laplace transform 
of F(po), which is indeed well-defined. 

Note: The behaviors of <f>(z) and <f>(p) exhibit a certain duality which 
reflects the dual nature of y G IR S+1 and p G (IR S+1 )* (section 1.1). 
We have just seen that when <p(p) behaves reasonably for spacelike p, 
then (j)(z) behaves reasonably for timelike y. In the trivial case when 
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cj)(p) = for p 2 < and (a) holds, <j)(z) is holomorphic for y 2 > 0. 
In fact, <p is then a generalized free field, hence may be said to be 
"trivial." This dual behavior also extends to p 2 > and y 2 < 0: For 
any non-constant field, 4>{p) is non-trivial for p 2 > 0; for spacelike 
y, the hyperplane H y ^ (which contains timelike as well as spacelike 

directions) therefore intersects the support of (p in a non-compact 
set and we do not expect 4>{z) to make sense as an operator-valued 
function outside of T. # 

No claims of analyticity can be made for (f>(z) in general. In fact, 
Greenberg's results show that <p{z) may not be analytic anywhere in 
G s+1 unless is a generalized free field. For as in the classical case, 
formal differentiation with respect to gives 

2td fI <P(z) = (2n)- s - 1 [ dp Pfl 5(yp)e-™ p 0(p), (26) 

hence 4> is the inverse Fourier transform of p^cj) in the hyperplane 

N y . If (j) is not a generalized free field, then, according to Greenberg, 

the support of <p contains sets of timelike as well as sets of spacelike p's 
with positive Lebesgue measure. Hence, for any nonzero y G IR S+1 , 
the intersection of the support of <j> with N y has positive measure in 
N y , so 4> will not be holomorphic at x — iy in general. As in the 
classical case, however, the above equation for <9 M </> implies that (p(z) 
is holomorphic along the vector field y, i.e., 

y^cj> = 0. (27) 

This is a covariant condition which, when specialized to y = (yo,0), 
states that 4>(z) is holomorphic in the complex time-direction. As we 
have seen, this result simply follows from the nature of the Analytic- 
Signal transform. A similar situation forms the basis of Euclidean 
quantum Geld theory. However, there one is dealing not directly with 
the field but with its vacuum expectation values, and the mathemat- 
ical reason for the analyticity is the spectral condition, which would 
appear to have little in common with the Analytic-Signal transform. 

Incidentally, eq. (26) provides a simple formal proof that 4>+{z) is 
holomorphic in T. The support of <fi+(p) is contained in V, hence for 
any y in V', its intersection with N y is either empty or equal to Qoo- 
But the contribution from p = vanishes, hence eq. (26) shows that 
d^(/)+(z) = in T. The same argument also shows that free fields and 
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generalized free fields are holomorphic in T. In the next two sections 
we shall study free Klein-Gordon and Dirac fields in more detail. 

Although <p{z) is not holomorphic in general, we will be able to 
establish for it one essential ingredient of the foregoing phase-space 
formalism, namely the interpretation of the double tube T as an ex- 
tended classical phase space for certain "particles" and "antiparticles" 
associated with the quantized field 4>. 

First, let us expand the above considerations to include charged 
fields by allowing (j)(x) to be non-Hermitian (i.e., a non-Hermitian 
operator-valued distribution). Then the extended field <p{z) need not 
satisfy the reflection condition (f>(z)* = <j>(z). The charge Q is defined 
as a self-adjoint operator which generates overall phase translations 
of the field, i.e., 

e~ iaQ <f>{x) e iaQ = e iae <j>(x) (28) 
for real a, where £ is a fundamental unit of charge. This implies 

[(f>(x),Q]=e(f>(x) 
[4>(p),Q]=e4>(p), 

showing that <f>(x) and 0(p) remove a unit e of charge from the field, 
while their adjoints add a unit of charge. We assume that phase 
translations commute with Poincare transformations, and in particu- 
lar with spacetime translations. Thus 

[Q,^]=0, (30) 

so charge is conserved. Q can be included in the "Cartan subalgebra" 
containing the P M 's, and the above commutation relations show that 

(j){p) and 4>{p)* are still "root vectors," with Q-root values — e and e, 
respectively. We also assume that the vacuum is neutral, i.e. Q^o = 
0. Repeated applications of <p and <p* to show that the spectrum 
of Q is {0,±£,±2e, ...}. 

Recall that the commutation relation between <j)(p) and _P M im- 
plied that 4>(p) removes an energy-mometum p from the field. Simi- 
larly, its adjoint relation 



(31) 
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shows that 4>(p)* adds an energy-momentum p to the field. In place 
of the generalized eigenvectors $ p of energy-momentum which we had 
for the Hermitian field, we can now define two eigenvectors, 

$ p =0(-p)* o , 

for each p e Si. For a non-Hermitian field, these vectors are inde- 
pendent. They are states of charge e and — e, respectively. We may 
think of them as particles and antiparticles, although they do not 
have a well-defined mass since p 2 will be variable on Si, unless is 
a free field. Each p ^ in Si belongs to the continuous spectrum of 
the -P M 's, since it can be changed continuously by Lorentz transforma- 
tions. Hence the "vectors" $^ are non-normalizable. Since the P^s 
are self-adjoint, and since and $~ belong to different eigenvalues 
of the charge operator (which is also self-adjoint), we have (with the 
usual abuse of Dirac notation, where "inner products" of distributions 
are taken) 

($+|$-) = 

(33) 

(^\^) = a(p 2 )(2n) s+1 S(p- q ), 

where a, a distribution with support in Si, depends only on p 2 by 
Lorentz invariance. (Charge symmetry requires that a be the same 
for particles as for antiparticles.) If (p is the free field of mass m > 0, 
a(p 2 ) = e(p ) 27i5(p 2 - m 2 ) = 2n5(p 2 - m 2 ) in F + \{0}. 
Now define the particle coherent states by 

e+ = 0(2:)*vl/ o 

= (2tt)- s - 1 / dpe^fayvo m , 
= (2tt)- s - 1 / dp e i ~ zp §+. 

Jv+ 

Like the 's, these do not have a well-defined mass; in addition, 
they are wave packets, i.e. have a smeared energy-momentum, but 
they still have a definite charge e . Their spectral components are 
given by 

( $+ | e+ ) = a{p 2 ) 6^ = a(p 2 )9(yp) e* 2 X (35) 
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If z belongs to the backward tube 71, then yp < on F + \{0}, hence 
e+ =0. If z belongs to the forward tube T + , then yp > and the 
vector e+ is weakly holomorphic in z. For the free field, it reduces to 
the coherent state e z defined in chapter 4. 

Similarly, define the coherent antiparticle states by 

= (2tt)- s - 1 f dp9- lzp 4>(p)^ 

= (2tt)- s - 1 / dp d lzv §~. 
Jv+ 

These are wave packets of charge — e for which 

( % I e~ > = a(p 2 ) = a(p 2 ) 9(-yp) e*". (37) 

Thus e~ vanishes in the forward tube and is weakly holomorphic in 
the backward tube. 

In the usual formulation of quantum field theory, particles are 
associated not directly with the interacting, or interpolating, field <f> 
but with its asymptotic fields 0i n and (f) ou t, which are free. (We will 
construct such free-particle coherent states in the next two sections.) 
However, the coherent states ef are directly associated with the 
interpolating field. We shall refer to them as interpolating particle 
coherent states (section 5.6). 

We are now ready to establish the phase-space interpretation of 
T in the general case. We will show that T + and 71 are extended 
phase spaces associated with the particle- and antiparticle coherent 
states e+ and e~ , respectively, in the sense that they parametrize the 
classical states of these particles. 

We first discuss i as a "position" coordinate. In the case of in- 
teracting fields there is no hope of finding even a "bad" version of 
position operators. Recall that position operators were in trouble 
even in the case of a one-particle theory without interactions! In the 
general case of interacting fields, this problem becomes even more se- 
rious, since one is dealing with an indefinite number of particles which 
may be dynamically created and destroyed. (As argued in section 4.2, 
the generators Mok of Lorentz boosts qualify as a natural, albeit non- 
commutative, set of center-of-mass operators; although I believe this 
idea has merit, it will not be discussed here.) Since no position oper- 
ators are expected to exist, we must not think of x as eigenvalues or 



(36) 
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even expectation values of anything, but rather simply as spacetime 
parameters or labels. 

On the other hand, y will now be shown to be related to the 
expectations of the energy-momentum operators (which do survive 
the transition to quantum field theory, as we have seen). For z,z' G 
7+, we have 

<e+|e+>H*o|0(*V(*)**o> 

= (2tt)- s - 1 / dp e~^ z ' ~~ z)v a{p 2 ) 
Jv+ 

= —[ dm 2 aim 2 ) I dp e~^ z ' ~ z)v 
1 f°° 

= / dm 2 aim 2 ) A + (z' — z: m), 

where we have set m 2 = p 2 and used 

(27T)- 5 - 1 dp = (2n)- s - 1 dp d s p = (27T)- 1 dm 2 dp, (39) 

with 

dp= [2(2tt) s y / m 2 + p 2 ] 1 d s p (40) 

the Lorentz-invariant measure on A + (w;m) is the two-point 

function for the free Klein-Gordon field of mass m, analytically con- 
tinued to w = z' — z G T + . In the limit y, y' — > 0, this gives the 
Kallen-Lehmann representation (Itzykson and Zuber [1980]) for the 
usual two-point function, 



1 f°° 

(* \4>(x')(f)(xy^o) = / dm 2 a(m 2 ) A+(x' - x;m), (41) 

2m J 

which is a distribution. In Wightman field theory, such vacuum expec- 
tation values are analytically continued using the spectral condition, 
and conclusions are drawn from these analytic functions about the 
field in reai spacetime. In our case, we have first extended the field 
(albeit non-analytically) , then taken its vacuum expectation values 
(which, due to the spectral condition, are seen to be analytic func- 
tions, not mere distributions). The fact that we arrived at the same 
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result (i.e., that "the diagram commutes") indicates that our approach 
is not unrelated to Wightman's. However, there is a fundamental dif- 
ference: The thesis underlying our work is that the "real" physics 
actually takes place in complex spacetime, and that there is no need 
to work with the singular limits y — > 0. 
The norm of e+ is given by 



|e+|| 2 = (2tt)- s - 1 / dpa(p 2 )e- 2yp 



1 

2^ 



'v+ 

dm 2 a(m 2 ) G(y; m), 



(42) 



where G(y;m), computed in section 4.4, is given by 

/ in \ ^ 

G(y;m) = (27r)- 1 (— ) K v (2Xm). (43) 

Recall that A = a/?/ 2 , v = (s — l)/2 and K v is a modified Bessel 
function. We assume that e+ is normalizable, which means that the 
spectral density function cr(m 2 ) satisfies the regularity condition 



le+ll 2 

l c z II 



= (2tt)- 2 / dm 2 a(m 2 ) ( ^-X K v {2\m) 
Jo \47rA/ 

= F{\) < 00. 



(44) 



(This condition is automatically satisfied for Wightman fields, where 
it follows from the assumption that is a tempered distribution; how- 
ever, it is also satisfied by more singular fields since K v decays expo- 
nentially.) It follows that 

(e+|P M e+) = (2ir)— 1 / dpa(p 2 )p,e- 2 y? 

Jv+ 

_1 dF(X) (45) 
~~ 2 dyv 
= -y lx F , (X)/2X. 

Using the recursion relation (Abramowitz and Stegun [1964]) 

-— (\~ v K u {2\m)) = 2m\- u K v+1 {2\m), (46) 

OA 

we find that the state e+ has an expected energy-momentum 
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(47) 



where 



m A = 



2F(A) 



/ °° dm 2 a(m 2 )m u+1 K v+1 {2\m) 
J dm 2 a(m 2 ) m v K u {2\m) 



(48) 



We call m\ the effective mass of the particle coherent states; it gener- 
alizes the corresponding quantity for Klein-Gordon particles (section 
4.4). The name derives from the relation 



It is important to keep in mind that in quantum field theory, the 
natural picture is the Heisenberg picture, where operators evolve in 
spacetime and states are fixed. Recall that for a free Klein-Gordon 
particle (chapter 4), we interpreted e z as a wave packet focused about 
the event x = $lz and moving with an expected energy-momentum 
(m\/\) y. This suggests that the above states ef be given a similar 
interpretation. Thus z becomes simply a set of labels parametrizing 
the classical states of the particles. 

This establishes the interpretation of T + as an extended classical 
phase space associated with the "particle" states e+. A similar com- 
putation shows that 71 acts as an extended classical phase space for 
the "antiparticle" states e~, whose expected energy-momentum is 



The expected angular momentum in the states ef can be com- 
puted similarly. The angular momentum operator M^ v is the gener- 
ator of rotations in the \i-v plane, hence 



(P) 2 = (P,}(P») 




(49) 





(50) 




(51) 



This implies for the Fourier transform 




(52) 
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Since the vacuum is Lorentz-invariant, we have M^^q = 0, hence 
for z e T + , 



M^et = (2tt)- s - 1 [_ dpe^iM^iip)*}^ 
Jv+ 

= W'" 1 J v+ dP **» (p^-P^) *(P)**0 (53) 

= (2tt)- s - 1 / dp (z li p v -z vPlt )e iZp Q+, 
Jv+ 

provided that 0(p) vanishes on the boundary of V + (this excludes 
massless fields). The expectation of M^ v is therefore related to that 
ofP^by 

(ilV) =z„{P v )-z v {P li ) 

= x li {P v ) -x u {P fl ) ( 5 4) 

Similarly, in e~ with zGT_, 

( ) = — i^fxVv — XvVn)- ( 55 ) 

This section can be summarized by saying that the vector y plays 
a similar role for general quantized fields as it did for positive-energy 
solutions of the Klein-Gordon equation, namely it acts as a con- 
trol vector for the energy-momentum. In other words, the function 
9{yp) e~ yp acts as a window in momentum space, filtering out from 
each mass shell fi m momenta which are not approximately parallel 
to y. The step function 9(yp) makes certain that only parallel com- 
ponents of p pass through this filter by eliminating the antiparallel 
ones (which would make the integrals diverge). Thus we may think 
of 6(yp) e~ yp as a kind of "ray filter" in V, when y e V . We continue 
to refer to y as a temper vector (section 4.4). 

Note: The regularity condition given by eq. (44) for cr(m 2 ), i.e. the 
requirement that e+ be normalizable, shows that A acts as an effective 
ultraviolet cutoff, since K v {2\m) decays exponentially as m — > oo, 
giving finite values to m\ and other quantities associated with the 
field. # 
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5.4. Free Klein— Gordon Fields 

In the context of general quantum field theory, we were able to show 
that T plays the role of an extended phase space for certain "parti- 
cle" states of the fields. The question arises whether the phase-space 
formalism of chapter 4 can be generalized to quantized fields. There, 
we saw that all free-particle states in the Hilbert space could be re- 
constructed from the values of their wave function on any phase space 
a C T + . There are two possible ways in which this result might ex- 
tend to quantized fields: (a) The vectors ef belong the subspaces H±i 
with charge ±e, hence we may try to get continuous resolutions, not 
of the identity on Tt but of the orthogonal projection operators TI-i-i 
to H±i, in terms of these vectors. (This can then be generalized to 
the resolution of the projection operator II n to the subspace 7i n with 
charge ne, n e 7Z.) (b) The global observables of the theory, such 
as the energy-momentum, the angular momentum and the charge 
operators, are usually expressed as conserved integrals of the field op- 
erators and their derivatives over an arbitrary configuration space S in 
spacetime (i.e., an s-dimensional spacelike submanifold of IR S+1 ); our 
approach would be to express them as integrals of the extended fields 
over 2s-dimensional phase spaces a in T, much as the inner products 
of positive-energy solutions were expressed as such integrals. In this 
section we do both of these things for the free Klein-Gordon field of 
mass m > 0, which is a quantized solution of 

(□ + m 2 ) </>(x) = 0. (1) 

We consider classical solutions at first. The Fourier transform <p{p) 
has the form 

4>(p) = 2n5(p 2 - m 2 ) a(p) (2) 

for some complex- valued function a(p) defined on the two-sheeted 
mass hyperboloid fi m = 0+ U fi~ . Write 



b(p) = a(-p), pen+. (3) 

If the field is neutral, then <f>(x) is real-valued and b(p) = a(p). For 
charged fields, a(p) and b(p) are independent. At this point, we keep 
both options open. Then 
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<j>{x) = (2tt)- s / dp 5(p 2 -m 2 ). 
= I dpe- lxp a(p) 

J f2™ 



-ixp 



a{p) 



(4) 



dp e~ lxp a(p) + e lxp b(p) 



The extension of (j>(x) to complex spacetime given by the Analytic- 
Signal transform is 



4>{x - iy) 



' _dr_ 
2ivi ./-oo t - i 



- 4>{x - ry) 



(2tt)- s - 1 / dp6- lzp (P(p) 



= I dp9- lzp a{p) 

= / « 



(5) 



9{yp) e~ lzp a(p) + 9{-yp) e lzp b(p) 



If y G V+, then yp > for all p G 0+ , hence 

<P(z) = [ dpe- lzp a(p) 



(6) 



is analytic in T + , containing only the positive-frequency part of the 
field. Similarly, when y G V!_, then yp < for all p G 0+ and 



4>(z) 



dpe tzp b(p) 7 zeT- 



(7) 



Thus 0(z) is also analytic in 71, where it contains only the negative- 
frequency part of the field. However, note that the two domains of 
analyticity 7+ and 71 do not intersect, hence the corresponding re- 
strictions of (j)(z) need not be analytic continuations of one another. 

We are now ready to quantize <p(z). This will be done by first 
quantizing (j)(x) and then using the Analytic-Signal transform to ex- 
tend it to complex spacetime. We assume, to begin with, that is a 
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neutral field, so b(p) = a(p). According to the standard rules (Itzyk- 
son and Zuber [1980]) of field quantization, <p{x) becomes an operator 
on a Hilbert space TC such that at any fixed time xq, the field "configu- 
ration" operators 0(xo,x) and their conjugate "momenta" do4>(xo,x) 
obey the equal-time commutation relations 

[(f)(x),(f)(x')} x ' o=xo = 
[cj)(x),d (j)(x')} x ' o=Xo =i8(x-x.'). 

This is an extension to infinite degrees of freedom of the canonical 
commutation relations obeyed by the quantum-mechanical position- 
and momentum operators. Note that since time evolution is to be 
implemented by a unitary operator, the same commutation relations 
will then hold at any other time as well. For the non-Hermitian 
operators a(p), the corresponding commutation relations are 

[a(p), a(p')} = 2p e(-pop' ) (2tt) s <5(p + p') (9) 

for p,p' G fi m . Using the neutrality condition a(— p) = a(p)*, these 
can be rewritten in their conventional form 

[a(p),a(p')} = 
[a(p),a(pr] = 2iv(2ny5(p-p') 

where now p,p' e 0+. 

A charged field can be built up from a pair of neutral fields as 

<p(x) = M*) + *Mx) : (n) 

v 2 

where the two fields 4>i,4>2 each obey the equal-time commutation 
relations and commute with one another. Equivalently, the operators 
a(p) and b(p) become independent and satisfy 

[a(p),a(p')] = [b(p),b(p')] = [a(p),b(p')} = 0, 
la(p),a*(p')} = [b(p),b*(p')] = 2^(27r) s 5(p-p') 

for p,p' e fi+ . The canonical commutation relations for both neutral 
and charged fields can be put in the manifestly covariant form 
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[<t>(p),4>(p')*] = 4tt 2 5(p 2 - m 2 ) 8(p' - m 2 ) [a(p), a(p')*] 

= (27r) s+2 2po Wo) <V - m 2 ) 5(p' 2 - p 2 ) S(p - p') 
= (27r) s+2 2po Wo) <V - m 2 ) 5(^ 2 - p 2 ) 5(p - p') 
= sign(po) (2tt) s + 2 <% 2 - m 2 ) <5(p - p') 

(13) 

for arbitrary p,p' G IR S+ . For charged fields, this must be supple- 
mented by 

[4>(p),4>(p')]=o, P ,p'eM s+1 . (14) 

Recall that in the general case we had 

(<5>;\<5>+)=a(p 2 )(2ny +1 5(p-p') (15) 

for p, p' G V+, where cr(p 2 ) is the spectral density for the two-point 
function (sec. 5.3, eq. (33)). For the free field now under considera- 
tion we have 

($+i$+) = (* o i0(p)0(p , r*o) 

= <*o|[#(p),#(pT]*o> (16) 
= (2tv) s+2 5(p 2 -m 2 )5(p-p'), 

which shows that the spectral density for the free field is 

a(p 2 ) = 2n5(p 2 - m 2 ). (17) 
The spectral condition implies that 

a(p)* = 0, b(p)^ = VpGO+, (18) 

since otherwise these would be states of energy-momentum —p. 
Hence the particle coherent states defined in the last section are now 
given by 



= [ dpe***+ zeT+, 



(19) 
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where the vectors are generalized eigenvectors of energy-momen- 
tum p G 0+ with the normalization 

<$;i$+)H*o|a(p)a(|/)**o> 

= (*o|[a(p),a(p , )*]* > (20) 
= 2u(2n) s <5(p-p'). 

The wave packets e+ span the one-particle subspace 7i\ of 7i and 
have the momentum representation 

(*+|c+)=e**. (21) 
A dense subspace of 7Yi is obtained by applying the smeared operators 

<p*(f) = 4>{fy = [ dx<p(xy f(x) = (27T)- 5 - 1 / dp fay f{p) 

(22) 

to the vacuum, where / is a test function. This gives 



9+ = <p(f)9 = (2ic)-'- 1 / dp f(p)4>*(p)Vo 
= (2tt)— 1 / dp f(p) d>+ = / dp f(p) *+ 



(23) 



where / is the restriction of / to 0+ . Hence the functions 

(e+|*+)= f dpe- iz *f(p) (24) 

are exactly the holomorphic positive-energy solutions of the Klein- 
Gordon equation discussed in section 4.4, with e+ corresponding to 
the evaluation maps e z . The space K, of these solutions can thus be 
identified with 7ii, and the orthogonal projection from TC to Hi is 
given by 

(n 1 *)(z) = (e+|*>. (25) 

Consequently, the resolution of unity developed in chapter 4 can now 
be restated as a resolution of III : 
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n 1= f da |e+)(e+|, (26) 

J (7-|- 

where a+, earlier denoted by cr, is a particle phase space, i.e. has the 
form 

a+ = {x-iy\x e S, yen+} (27) 

for some A > and some spacelike or, more generally, nowhere time- 
like (see section 4.5) submanifold S of real spacetime. As in section 
4.5, the measure da is given in terms of the Poincare-invariant sym- 
plectic form a = dy^ A dx^ by restricting a s = a A ■ • • A a to a+ and 
choosing an orientation: 

da = (slAx)' 1 ^ =A- 1 dy^ A^. (28) 

Similarly, the antiparticle coherent states for the free field are 
given by 



0(2) #0= / dpe iz Pb(p)*y 



dpe lzp %, zeT_. 



/ t ,- 

•/fit 



(29) 



Since for p e 0+ and z 6 T_ we have 

(%-|e 2 -) = e^ = ($;|e+), (30) 

it follows that e~ has exactly the same spacetime behavior as ej, 
confirming the interpretation of an antiparticle as a particle moving 
backward in time. An antiparticle phase space is defined as a sub- 
manifold of T- given by 

cr_ = {x - iy | x E S, ye O^}, (31) 
where S is as above. The resolution of n_i is then given by 

n-i= / da\e-)(e-\. (32) 

J cr_ 

Many-particle or -antiparticle coherent states and their corre- 
sponding phase spaces can be defined similarly, and the commutation 
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relations imply that such states are symmetric with respect to per- 
mutations of the particles' complex coordinates. For example, 

e+ 22 ^i)>(^)**o = e+ 22l , 

since <j>(zi) and §{z-i) commute. In this way, a phase-space formal- 
ism can be buit for an indefinite number of particles (or charges), 
analogous to the grand-canonical ensemble in classical statistical me- 
chanics. This idea will not be further pursued here. Instead, we 
now embark on option (b) above, i.e. the construction of global, 
conserved field observables as integrals over particle and antiparticle 
phase spaces. 

The particle number and antiparticle number operators are given 

by 



N+= [ dp a(p)* a{p) 
N_= [ dpb(p)*b(p), 



(33) 



N + and iV_ generalize the harmonic-oscillator Hamiltonian A* A to 
the infinite number of degrees of freedom possessed by the field, where 
normal modes of vibration are labeled by p G 0+ for particles and p e 
fi~ for antiparticles. The total charge operator is Q = e (N + — N_), 
as can be seen from its commutation relations with a(p) and b(p). But 
the resolution of unity derived in chapter 4 can now be restated as 

/ da exp(izp - izp') = (2tt) s 2w(p) S(p - p') = ( $J | $+ ) 

7 (34) 

/ da exp(izp - izp) = (2tt) s 2w(p) 6(p - p') = ( $ p | ® p , ) 

J <T_ 

for p,p' e f2+, where the second identity follows from the first by re- 
placing z with z and a+ with a_ . It follows that N± can be expressed 
as phase-space integrals of the extended field <f>(z): 



N+= I da <f>(z)* <f>(z) 

J (7+ 

N_= I da<f>(z)<f>(z)*. 
J a- 



(35) 
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Hence the charge is given by 

Q = e I da <j>{z)* <j>{z) - e f da </>(z) (j){z)* . (36) 

J (7-|- J <T_ 

The two integrals can be combined into one as follows: Define the 
total phase space as a = cr+ — a_ , where the minus sign means that 
cx_ enters with the opposite ("negative") orientation to that of cr + , 
in the sense of chains (Warner [1971]). The reason for this choice of 
orientation is that and are both open sets of IR S+1 , hence 
must have the same orientation, and we orient and fi^ so that 

Q± = -dB±. (37) 

Since the outward normal of B^ points "down" and that of B^ points 
"up," this means that must have the opposite orientation to that 
of 0^. Thus, setting B\ = B^ + B^ and Q\ = — O^", we have 

A = -dB x . (38) 
This gives a_ the orientation opposite to that of cr + , and we have 

a = Sxn x = a + -a-. (39) 
Next, define the Wick-ordered product (or normal product) by 

'.tr.. w 

This coincides with the usual definition, since in T + , 4>* is a creation 
operator and is an annihilation operator, while in 71 these roles are 
reversed. The charge can now be written in the compact form 

Q = e I da : (j){z)* <j>{z) : (41) 
J a 

We may therefore interpret the operator 

p(z)=e: <f>(z)* <f>(z) : (42) 

as a scalar phase-space charge density with respect to the measure 
da. 
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The Wick ordering can be viewed as a special case of imaginary- 
time ordering, if we define <j>*(z) = (f>(z)*: 

: cj>{z'y <j>{z) :=: <t>*{z') <j>{z) := T, [cj>*{z') cj>{z)\ , (43) 

where 

= 9 (3(4 - zq)) <P*(z') Hz) + 9 ($S(zo - 4)) 4>(z) V) 
for z, z' G T. This definition is Lorentz-invariant, since 

TiW)<l>{z)] = <j>{z')cj>{z) = cj>{z)cj>{z') \Jz,z'ET (45) 

and 

T i [r(z') ( f>(z)} = r(z')^) = ^)r(z') m 

when z and z' are in the same half of T, whereas if they are in opposite 
halves of T, the sign of Q(z' — zq) is invariant. 

Note: For the extended fields, the Wick ordering is not a necessity 
but a mere convenience, allowing us to combine the integrals over 
cr_|_ and <r_ into a single integral. Each of these integrals is already 
in normal order, since the extension to complex spacetime polarizes 
the free field into its positive-and negative-frequency parts. Also, 
the extended fields are operator-valued functions rather than distri- 
butions, hence products such as (j>(z)* (f)(z) are well-defined, which is 
not the case in the usual formalism. A similar situation will occur in 
the expressions for the other observables (energy-momentum, angular 
momentum, etc.) as phase-space integrals. Hence the phase-space 
formalism resolves the problem of zero-point energies without the 
need to subtract infinite terms "by hand" ! In this connection, see the 
remarks on p. 21 of Henley and Thirring [1962]. # 

The above expression for the charge can be related to the usual 
one in the spacetime formalism, which is 



n ■ f T a* ^ W a< \ 
Qusual = is dx^ : — — • (p{x) 

= [ S M J^(x), 
Js 



(47) 



'S 

by using Q\ = —dB\ and applying Stokes' theorem: 
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Q = eA~ 1 [ dx^ [ dy^ : 0*0 

_d 



= —e A x 1 f dx^ [ dy : (p* 4> : 



J s Jb x 



iy), 



where 



is the phase-space current density. Using the notation 

d ltd . d 

= -7T— = - -7: h l- 



dz^ 2 \dx^ dy^ 
we have 

Hi)' 1 <)"). 



dy^ 

Hence, by the holomorphy of <f>, 



dy^ 

= is (d" - 0") : <j)* <j) : 
= is : (p* d^(j) - 8»(P* ■ (j) : 
d(j) 8(f)* 



— is : 



dx^ dx^ 
Our expression for the charge is therefore 

Q= dx^ J ( M A) (x), 
J s 

where 

J» X) ( X )=A- 1 I dyf{x-iy) 
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is seen to be a regularized version of the usual current density J M (x) 
obtained by first extending it to T and then integrating it over B\. 

The vector field j^(z) is conserved in reaJ spacetime for each fixed 
y e V, since 



0: 

: (j)* 6 : 

(55) 



= is (d„ + <9 M ) (0" - 0") : 
= is (D z - U z ) :(/>*(/> : 
= : ((f)* n z( f> - U- z <$>* ■ <(>) 

= o, 



by virtue of the Klein-Gordon equation combined with the holomor- 
phy of (p in T. This implies that (x) is also conserved, hence the 

charge does not depend on the choice of S or a. 

Note: In using Stokes' theorem above, we have assumed that the 
contribution from \yo\ — > oo vanishes. (This was implicit in writing 
the non-compact manifold Q\ as —dB\.) This is indeed the case, as 
has been shown rigorously in the context of the one-particle theory 
in chapter 4 (theorem 4.10). Also, we see another example of the 
pattern, mentioned before, that in the phase-space formalism vector- 
and tensor fields can often be derived from scalar potentials. Here, 
p(z) acts as a potential for j^(z). Note also that the Klein-Gordon 
equation can be written in the form 



(□ 2 + m 2 )p{z) =e:<j)*{n z + m 2 )(p := 0, (56) 

which is manifestly gauge-invariant. # 

Recall now that <f>(z) is a "root vector" of the charge with root 
value — e: 

[(j){z'), Q] = e <f>(z') Vz' e (D s+1 . (57) 



Substituting our expression for Q, we obtain the identity 



5.4. Free Klein-Gordon Fields 



191 



4>{z')= [ da{z)W)s m* <!>{*) '\ 

J a 

= I da(z)[<p(z'),<P(zy]<p(z) (58) 

J a 

= I d*(z)K(z',z)<t>(z), 

J a 

where, by the canonical commutation relations, 

K(z',z)^[cf>(z'), ( f>(zr] = [ ( f>(z% ( f>*(z)} 

= (27r)- 2s - 2 | dp' J dpe-* z 'v' e^WUipf] 

= (2tt)- s J dp' J dp9- lz ' p ' 6 lzp sign(p ) 5(p 2 -m 2 )5(p' - p) 
dp sign(po) 0(yp) 9(y'p) exp[~i(z' - z)p). 



(59) 

K is a distribution on (E s+1 x (E s+1 which is piecewise analytic in 
T x T, with 



-i 



iA + (z' — z;m), z',z<ET + 



0, z'eT_,zeT + . 

The two-point functions — iA + and iA~ are analytic in T + and T_, 
respectively, and act as reproducing kernels for the subspaces with 
charge e and —e. Because of the above property, it is reasonable 
to call K(z',z) a reproducing kernel for the field 4>(z), though this 
differs somewhat from the standard usage of the term as applied to 
Hilbert spaces (see chapter 1). Note that K propagates positive- 
frequency components of the field into the forward ("future") tube 
and negative-frequency components into the backward ( "past" ) tube. 
This is somewhat reminiscent of the Feynman propagator, but K is 
a solution of the homogeneous Klein-Gordon equation in the real 
spacetime variables rather than a Green function. 

The energy-momentum and angular momentum operators may be 
likewise expressed as conserved phase-space integrals of the extended 
field: 
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P M = % I da : 0*<9 M <; 



= i da : (j)*(x^d u - x v d^§ : . 

J a 



(61) 



Like Q, these may be displayed as regularizations of the usual, more 
complicated expressions in real spacetime. Note first that _P M can be 
re-written as 



D ^ = iA~ 1 [dx T [ d~y T :0*<9 M 

JS JClx 



5* 

1 



1 dx T dy T :(f>* - d^* ■ (j> 
Js Jn x 



= — 2 A- x 1 / dx 



d 



s Jq x dy» 



The angular momentum can be recast similarly as 



(62) 



= i 



A x x I dx T dy T : (f)* {x^d u - x v d v ) (f) : 

JS JVL X 

-A' 1 dx T dy T [x^{d v - B v ) - x v {d^ - d^)] : <f>*<f> 

- JS J€l x 

dy T 



dy v v dyv 



(63) 

Using fl\ = —dB\ and applying Stokes' theorem, we therefore have 

8 2 

(64) 

dx T T$(x), 

! s 



= J Vr(J'(.), 



where 



d 2 



(65) 
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is a regularized energy-momentum density tensor which, incidentally, 
is automatically symmetric. Similarly, 




— x^T vl ){x) XvT^ (x) 
is a regularized angular momentum density tensor. 



5.5. Free Dirac Fields 

For simplicity, we specialize in this section (only) to the physical case 
of three spatial dimensions, s = 3. The proper Lorentz group Co is 
then SO(3, 1) + , where the plus sign indicates that A(j > 0, so that A 
preserves the orientations of space and time separately. Its universal 
covering group can be identified with ST (2, (E) as follows (Streater and 
Wightman [1964]): An event x G IR 4 is identified with the Hermitian 
2x2 matrix 

X +X X IX 

X L + IX 2 x u - X 6 J y ' 

where do = / (2 x 2 identity) and o~k (k = 1,2,3) are the Pauli spin 
matrices. Note that detX = x 2 = x ■ x. The action of SL(2, (E) on 
Hermitian 2x2 matrices given by 

X' = AXA\ AeSL(2,V), (2) 
induces a linear transformation on IR 4 which we denote by n(A): 



X = 7t(A) X 



(3) 
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From 

x' 2 = detX' = \detA\ 2 det AT = det AT = x 2 (4) 

it follows that rr(A) is a Lorentz transformation, and it can easily be 
seen to be proper. Hence it defines a map 

tt: SX(2,(D) -> £ , (5) 

which is readily seen to be a group homomorphism. Clearly, n(— A) = 
tt(A), and it can be shown that if n{A) = n(B), then A = ±B. Since 
5X(2, d) is simply connected, it follows that SX(2, (E) is the universal 
covering group of Cq, the correspondence being two-to-one. 
The relativistic transformation law as stated in section 5.3, 

(j>(Ax + a) = U(a,A)(j>(x)U(a,A)*, (6) 

applies to scalar fields, i.e. fields without any intrinsic orientation or 
spin. To generalize it to fields with spin, note first of all that since 
the representing operator U (a, A) occurs quadratically, the law is in- 
variant under U — > — U. This means that U could, in fact, be a rep- 
resentation, not of P 7 but of the inhomogeneous version of SL{2, (D), 

V 2 =IR 4 (DSL(2,(D), (7) 

which acts on IR by 

(a, A) x = re (A) x + a. (8) 

V2 is the two-fold universal covering group of Vq. A field ip(x) of 
arbitrary spin is a distribution taking its values in the tensor product 
L(T~i) <E> V of the operator algebra of the quantum Hilbert space 7i 
with some finite-dimensional representation space V of SL(2, (E). The 
transformation law is 

U(a, A) tfj(x) U(a, A)* = SiA' 1 ) if;(n(A)x + a), (9) 

where S is a given representation of SL(2, (E) in V. S determines the 
spin of the field, which can take the values j = 0, 1/2, 1, 3/2, 2, — 

The locality condition for the scalar field (axiom 4) can be ex- 
tended to non-scalar fields as 



[Mx),Mx')]=Q iHx-x') 2 <0 (10) 
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where ip a are the components of ift. Now it follows from the axioms 
that if the field has half-integral spin (j = 1/2,3/2,...), then the 
above locality condition implies that it is trivial, i.e. that ifj(x) = 0. 
A non-trivial field of half-integral spin can be obtained, however, 
if we modify the locality condition by replacing commutators with 
anticommutators: 



Replacing the commutators with anticommutators means that chang- 
ing the order in which ^(x) and tfj(x') are applied to a state vector 
in Hilbert space merely changes the sign of the vector, which has no 
observable effect. Hence the physical interpretation that events at 
spacelike separations cannot influence one another is still valid. 

Similarly, for fields of integral spin (j = 0,1,...), the locality 
condition with anticommutators gives a trivial theory, whereas a non- 
trivial theory can exist using commutators. 

The choice of commutators or anticommutators in the locality 
condition does, however, have an important physical consequence. For 
we have seen that the free asymptotic fields can be written as sums 
of creation and destruction operators for particles and antiparticles. 
If distinct points in the hyperplane x° = and 

ip+(x) denotes the positive-frequency part of the field (which can be 
obtained from if;(x — iy) by taking y — > in VT), then 



is a state with n particles located at these points. Since any two of 
these points are separated by a spacelike distance, the locality condi- 
tion implies that this state is symmetric with respect to the exchange 
of any two particles if commutators are used and antisymmetric with 
respect to the exchange if anticommutators are used. Particles whose 
states are symmetric under exchange are called Bosons, and ones an- 
tisymmetric under exchange are called Fermions. The choice of sym- 
metry or antisymmetry crucially affects the large-scale statistical be- 
havior of the particles. For example, no two Fermions can occupy the 
same state due to the antisymmetry under exchange; this is the Pauli 
exclusion principle. Hence the choice of commutators or anticommu- 
tators is known as the choice of statistics, and the above theorem 
correlating this choice with the spin is known as the Spin-Statistics 
theorem of quantum field theory (Streater and Wightman [1964]). 



{^ a {x)^p{x')} = ip a (x)ip p (x') +ipp(x')ip a (x) 
= if (x - x'f < 0. 



(11) 



V' + (^i)*V+(a;2)*---V' + (^ n )**o 



(12) 
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This theorem is fully supported by experiment, and represents one of 
the successes of the theory. 

The Dirac field is a quantized field with spin 1/2 whose asso- 
ciated particles and antiparticles are typically taken to be electrons 
and positrons, though it is also used (albeit less accurately) to model 
neutrons and protons. Our treatment follows the notation used in 
Itzykson and Zuber [1980], with minor modifications. The free Dirac 
field is a solution of the Dirac equation 

(i# - m)iP(x) = 0, (13) 

where 

is the Dirac operator and the 7's are a set of 4 x 4 Dirac matri- 
ces, meaning they satisfy the Clifford condition with respect to the 
Minkowski metric: 

{7^,7^=7^ + 7^ = 2^. (15) 

The components of ip satisfy the Klein-Gordon equation, since 
^ 2 = □ , and the solutions can be written as 



iP(x) = / dp [e- ixp u a (p)b a (p) + e ixp v a (p)d* a (p)} , (16) 

where u a and v a are positive- and negative-frequency four-compo- 
nent spinors and summation over the polarization index a = 1,2 is 
implied. b a and d a are operators satisfying the "canonical anticom- 
mutation relations" 



{b a (p), bp(q)} = {d a (p),d{3(q)} = {b a (p), dp(q)} = 0, 
{&«(?),&£(?)} = {da(p), = M27r) 3 5(p-q) 

for all p,q G 0+ . The spinors satisfy the orthogonality and complete- 
ness relations 
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u a {p) vP{p) = -v a {p) v (p) = 5 ap 
u«{p)®u«{p) = t±^ (lg) 

v a (p)®v a (p) = ^^, 
2m 

where a summation on a is implied in the last two equations and the 
adjoint spinors are defined by 

u a (p) = w Q (p)*7°, v a (p) = v a (p)*<y°. (19) 

In addition, 

u a (p) 1„ u P ip) = v a (p) 7 M v p {p) = P»6 af3 . (20) 

b a (p) and d a (p) are interpreted as annihilation operators for parti- 
cles and antiparticles, respectively, while their adjoints are creation 
operators. The adjoint field is defined by 



V>(x)=V(^)*7 (21) 

and satisfies 

The particle- and antiparticle number operators are now 

N+= f dpb* a {p)b a {p) 
/o+ 

Jn (23) 
N_= / dpd* a (p)d a (p), 

and the charge operator is 

Q = £ (iV + -iV_). (24) 

As for the Klein-Gordon field, we wish to give a phase-space rep- 
resentation of Q. The first step is to extend ip(x) to (E 4 using the 
Analytic-Signal transform, which gives 
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1>{z)= [ dp [6-^u a (p)b a (p)+6^v a (p)d* a (p)}. (25) 

Again, the extended field is analytic in T, with the parts in T + and 71 
containing only positive and negative frequncies, respectively. Using 
the above orthogonality relations, as well as 

/ da e i2p - izq = I da e izp - izq = 2w(p) (2tt) 3 <5(p - q) (26) 

J cr_|_ </<7_ 



for p,q e 0+, we obtain the following expressions for the particle- 
and antiparticle number operators as phase-space integrals: 



N + = da ip(z) ip(z) = da : ip(z) ip(z) 

(27) 

N_= da : ip(z)ip(z) :, 



where the fields in the first integral are already in normal order and 
the second integral involves two changes of sign: one due to the normal 
ordering, and another due to the orthogonality relation for the v a 's. 
The charge operator can therefore be given the following compact 
expression as a phase-space integral over the oriented phase space 
a = a+ — a-: 

Q = e da : ijifj := e dap(z), (28) 

J a J a 

where p = : tfjtfj : is the scalar phase-space charge density. The usual 
expression for the charge as an integral over a configuration space S 
is 

Qusual = e [ dlc^: J{x)Y ^{x) : = [ dx^{x). (29) 
Js Js 



To compare these two expressions, we again use Q\ = —dB\ and 
invoke Stokes' theorem: 
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J " A (30) 



A 

Js Jb x dy„ 
Define the phase-space current density 



f{z) = 2me:^{z) 1 ^{z):, (31) 

where the factor 2m is included to give the correct physical di- 
mensions, given our normalization. Note that j^(z) is conserved in 
spacetime, i.e. 

d $ u , , 7 u 94> (32) 
= 2me:— ^ + ^ = 

= 

by the Dirac equation combined with the analyticity of ^ in T. The 
same combination also implies 

l -r{z)=e:^l u d^: 



= ied v : ip^^ip : 

= ied v : ${g^ - : 

= ied^ : ipip : +ed u : ^ v i\) :, 



(33) 



where 



^ v = \\l^l v \ (34) 

are the spin matrices. The real part of this equation gives a phase- 
space version of the Gordon identity 

f{z) = ie{d» - d») : ^ : +e(d v + d v ) : ipa^ip : 

dp d T uv . (35) 
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The two terms are conserved separately, since 

9 P = ie{U- z - D 2 ) :^: = 

^ (36) 

and the second term, which is due to spin, does not contribute to the 
total charge since it is a pure divergence with respect to x. Thus 

Q = A~ 1 [dx^ [ dyf{z)= j d^J^Cx), (37) 

J S J B\ J S 

where 

J{ x) (x)=A? j dyf{x-iy) (38) 

is a "regularized" spacetime current. 

Note: The Dirac equation can also be written in the manifestly 
gauge-invariant form 

%d li f{z) = 2me : V>7 M d M ^ : = 2m 2 p(z). # (39) 

Again, ip is a "root vector" of the charge operator, since it removes 
a charge e from any state to which it is applied: 

[iP(z'),Q]=£iP(z') V/e(D 4 . (40) 

Substituting for Q the above phase-space integral and using the com- 
mutator identity 

[A, BC] = {A, B}C - B{A, C} (41) 
and the canonical anticommutation relations, we obtain 

^{z')= j da{iP(z%W)}^)= [ daK D (z',z)i;(z), (42) 

where the "reproducing kernel" for the Dirac field is a matrix-valued 
distribution on (E 4 x (E 4 given by 
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K D (z' 1 z) = {iP(z%iP(z)} 

= [ dp \0{y'p) 6{yp) e^ 2 '" 2 "^ u a ® u a 



+ 0(-y'p)6(-yp) e^ z ~~ z > p v 



i{z'-z)p Q ,a ^ 



(43) 



it/)' + m 
2m 



K(z',z). 



Here, K is the reproducing kernel for the Klein-Gordon field and <j)' 
is the Dirac operator with respect to the real part x' of z' . Like K, 
Ko is piecewise holomorphic in z' — z for z',z G T. Another form 
of the reproducing relation can be obtained by substituting the more 
complicated expression for Q given by eq. (37) into eq. (40): 

i;(z') = 2mA- 1 iTxJ dyK D (z',z)^i;(z). (44) 
Js Jb x 

This form is closer to the usual relation. 

The energy-momentum and angular momentum operators for the 
Dirac field can likewise be represented by phase-space integrals as 



Pfi= I da : ip id^ip : 

J a 

= da : V> {ix^d v - ix v d^ + \a^ v ) ip 

J a 



(45) 



More generally, let ip(z) represent either a Klein-Gordon field (in 
which case ip will mean ip*) or a Dirac field, and let T a be the local 
generators of an arbitrary internal or external symmetry group, so 
that the infinitesimal change in ip(z) is given by 

5^{z) = -ie a T a ^(z). (46) 

For example, T a is multiplication by e for {7(1) gauge symmetry, = 
id^ for spacetime translations (where the derivative is with respect 
to x M ), etc. (In case the theory has an internal symmetry higher 
than i7(l), of course, ip must have extra indices since it must be 
valued in a representation space of the corresponding Lie algebra.) 
The generators satisfy the Lie relations 
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[T a ,T b ] = C c ab T c , (47) 

where C c ah are the structure constants. Then we claim that the con- 
served global field observable corresponding to T a is 

Q a = [ da :^T a ^:. (48) 

For this implies 

mz'),Q a } = ! daK D {z',z)T a ^{z), (49) 

J a 

where Kd is replaced by K if tjj is a Klein-Gordon field. Since T a 
generates a symmetry, it follows that T a ip(z) is also a solution of the 
appropriate wave equation, hence it is reproduced by Kd- 

I daK D (z',z)T a ^(z) = T a ^{z'). (50) 
Therefore Q a has the required property 

[iP(z%Q a ] = T a iP(z'). (51) 
It can furthermore be checked that 



[Qa,Qb]= I da :ij{T a ,T b }ij:=C c ab Q c , 
J a 



(52) 



hence the mapping T a \— > Q a is a Lie algebra homomorphism. 

Finally, we show that due to the separation of positive and nega- 
tive frequencies in T, the interference effect known as Zitterbewegung 
does not occur for Fermions in the phase-space formalism. Let St be 
the configuration space defined by x° = t. Then the components of 
the "regularized" three-current at time t are 

jUt)=2mA~ 1 [ rf 3 x / dy:^:, (53) 
Js t Jb x 

and a straightforward computation gives 

J(X)(t) = I dp (£) [bl{ P ) b a {p) - d* a ( P ) d a {p)] . (54) 
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The right-hand side is independent of t, hence no Zitterbewegung 
occurs. In real spacetime, Zitterbewegung is the result of the in- 
evitable interference between the positive- and negative-frequency 
components of ip. Its absence in complex spacetime is due to the po- 
larization of the positive and negative frequencies of into T + and 
71, respectively. 

In the usual theory, Zitterbewegung is shown to occur in the 
single-particle theory; the above computation can be repeated for the 
classical (i.e., "first-quantized") Dirac field, with an identical result 
except for a change in sign in the second term due to the commuta- 
tion of d* a and d a . Alternatively, the above argument also implies the 
absence of Zitterbewegung for the one-particle and one-antiparticle 
states of the Dirac field. 



5.6. Interpolating Particle Coherent States 

We now return to the interpolating charged scalar field 4>. The asymp- 
totic fields satisfy the Klein-Gordon equation, 

(□ + m 2 ) cj) in {x) = 0, (□ +m 2 )cj) out {x) = (1) 

and have the same vacuum expectation values as the free Klein- 
Gordon field discussed in section 5.4. Hence, by Wightman's recon- 
struction theorem (Streater and Wightman [1964]), these three fields 
are unitarily related. We identify the free field of section 5.4 with 4>- in . 
Then there is a unitary operator S such that 

<f> out (x) = Sct> in (x)S*. (2) 

S is known as the scattering operator. 
Define the source field j(x) by 

j(x) = (□ +m 2 ) (f)(x). (3) 
It is a measure of the extent of the interaction at x, and by axiom 5, 

j(x) — > (weakly) as x° — > ±oo. (4) 

Note that we are not making any additional assumptions about j. If j 
is a known function (i.e., if it is a multiple of the identity on 7f for each 
x), then it acts as an external source for <fi. If, on the other hand, j 
is a local function of 4> such as : </> 3 :, it represents a self-interaction of 
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(p. In any case, the above equations can be "solved" using the Green 
functions of the Klein-Gordon operator, which satisfy 

(D x + m 2 )G(x) =S(x). (5) 
In general, we have formally 

*(»> = *>(») + /<*G(»-*W), W 

where 0o is a f ree held determined by the initial or boundary condi- 
tions at infinity used to determine G. The retarded Green function 
(we are back to s spatial dimensions) is defined as 



where 



P+ = (Po + ie, P) (8) 



with e > and the limit e j is taken after the integral is evaluated. 
G r et propagates both positive and negative frequencies forward in 
time, which means that it is causal, i.e. vanishes when xq < 0. Since 
it is also Lorentz-invariant, it follows that 

G ie t(x - x') = unless x - x' e V+. (9) 

G re t(x — x') is interpreted as the causal effect at x due to a unit 
disturbance at x' . The corresponding choice of free field 0o is 4>im 
hence 

4>{x) = (f>i n (x) + / dx' G ret (x - x')j(x'). (10) 



If j is a known external source, this gives a complete solution for (j)(x). 
If j is a known function of (p, it merely gives an integral equation which 
4> must satisfy. 

Similarly, the advanced Green function is defined by 

G adv (x - x') = (2n)- s ~ 1 j dp X J , (11) 
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with p- = (po — it, p) and e J, 0, and propagates both positive and 
negative frequencies backward in time, which means it is anticausal. 
The corresponding free field is 4> ou t, hence 

<f>(x) = 0out(aO + J dx'G adv (x - x') j(x'). (12) 

Let us now apply the Analytic-Signal transform to both of these 
equations: 

<f>(z) = <hn(z) + J dx' G Tet (z - x')j(x') 
cf)(z) = (pout(z) + J dx' G adv (z - x') j{x'), 
where (with z = x — iy) 

G ret (z - x') = ^— [ G Tet (x -ry- x') 



(13) 



=(2.r-w dp 



(14) 



and 



G adv (z -x') = — / ; G adv (x -ry - x') 

2m J_ 00 t - i 

= {2n)- s ~ 1 [ dp 



9{yp) e -i(z-x')p 
(p 2 _ — m 2 ) 



(15) 



Since the Analytic-Signal transform involves an integration over the 
entire line x(r) = x — ry, the effect of G ret (z — x') is no longer 
causal when regarded as a function of z and x' . Rather, it might 
be interpreted as the causal effect of a unit disturbance at x' on the 
line parametrized by z. (Note that only those values of r for which 
x — ry — z' G V+ contribute to the integral.) A similar statement goes 
for G adv (z -x'). 

Whereas <f>i n (z) and (f) ut( z ) are holomorphic in T, <f>(z) is not (un- 
less j(x) = 0), since G re t(z — x') and G adv (z — x') are not holomorphic. 
This breakdown of holomorphy in the presence of interactions is by 
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now expected. Of course 0, G re t and G a dv are all holomorphic along 
the vector field y, as are all Analytic-Signal transforms. 

In Wightman field theory, the vacua ^q 1 , ^Q Ut and \l/o of the 
in-, out- and interpolating fields all coincide (the theory is "already- 
renormalized" ) . Let us define the asymptotic particle coherent states 
by 

e t,z = 0in(*)**O 
e in,z = 0in(^)*O 
etut,z = </>out(^)*^0 

eout,z = (pout(z)Vo. 

We will refer to 



e+ 



e+ 2 + j dx' G ret (z-x>)j(x')*Vo 
etut,z + J dx' G adv (z-x')j(x'yq 



and 



e in,2 



e out,z 



+ J dx' G ret (z-x')j(x')q 
+ J dx' G adv (z-x')j(x')q - 



From the definitions it follows that 



G vet (z - x') = G adv (x' - z) 



G adv (z - X 1 ) = G ret (x' - Z), 

hence eq. (18) can be rewritten as 



4 = < 2 + f dx'G &dv (x' - z)j(x')* * 
= e+ ut)Z + y dx'G ret {x' -z)j(x')** . 



(16) 



e+ = 0(^**0, ej = 0(2:)*o (17) 
as the interpolating particle coherent states. By eq. (13), 



(18) 



(19) 



(20) 



(21) 
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Eqs. (19) and (21) display the interpolating character of ef. Note 
that when j(x) is an external source, then the interpolating particle 
coherent states differ from the asymptotic ones by a multiple of the 
vacuum. 

As in the case of the free theory, a general state with a single 
positive charge e can be written in the form 

*+ = 0*(/)* o . (22) 

For interacting fields, this may, in general, no longer be interpreted as 
a one-particle state, since no particle-number operator exists.* But 
the charge operator does exist since charge (unlike particle-number) is 
conserved in general, due to gauge invariance; hence makes sense 
as an eigenvector of charge with eigenvalue e. \I/jr can be expressed 
in terms of particle coherent states as 



f{z) ee ( e+ | ) 

= ( < 2 I */ ) + J dx'G ret (z - x') ( * \j(x') 
= ( e+ t)2 | 9+ ) + J dx' G adv (z - x') ( *o | j(x') tf+ ) 



+ ) 

(23) 



f(z) satisfies the inhomogeneous equations 



(p x + m 2 ) f(z) = J dx' (U x + m 2 ) G Yet (z - x') ( tf | j(x') ) 

= J dx' (a x + m 2 )G adv (z-x')(y \j(x')y+). 

(24) 

But from the definitions it follows that 

* If the spectrum E contains an isolated mass shell and f(p) 
is concentrated around , then is, in fact, a one-particle state. 
This is the starting point of the Haag-Ruelle scattering theory (Jost 
[1965]). I thank R. F. Streater for this remark. 
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(p x + m 2 ) G Tet (z - x') = (D x + m 2 ) G aAv (z - x') 

= (2tt)- s - 1 / dp 9(yp) e- l{z - x ' )p (25) 
= S(z — x'), 

where the last equation is a definition of 5(z — x') as the Analytic- 
Signal transform with respect to x of 5(x — x'). The above is easily 
seen to reduce to 

(n x + m 2 )f(z) = (y \j(z)y+), (26) 

where j(z) is the Analytic-Signal transform of j(x). Equivalently, 
eq. (3) can be extended to (E s+1 by applying the Analytic-Signal 
transform, giving 

(D x + m 2 )(P(z) =j(z), (27) 

hence 

(n x + m 2 ) f(z) = ( * I (□« + m 2 ) cj>{z) \ ) = ( * I j(z) ^ )• (28) 

For a known external source, this is a "perturbed" Klein-Gordon 
equation for f(z); if j depends on 0, it appears to be of little value. 



5.7. Field Coherent States and Functional Integrals 

So far, all our coherent states have been states with a single particle or 
antiparticle. In this section, we construct coherent states in which the 
entire field participates, involving an indefinite number of particles. 
We do so first for a neutral free Klein-Gordon field (or a generalized 
free field; see section 5.3), then for a free charged scalar field. A sim- 
ilar construction works for Dirac fields, but the "functions" labeling 
the coherent states must then anticommute instead of being "classi- 
cal" functions and a generalized type of functional integral must be 
used (Berezin [1966], Segal [1956b, 1965]). We also indulge in some 
speculation on generalizing the construction to interpolating fields. 
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An extended neutral free Klein-Gordon field satisfies the canoni- 
cal commutation relations 

[<j>(z),4>(z')}=0 
[ ( f>(z),cf>(zr} = K(z,z') = -iA+(z-z') 

for all z,z' G T + , as well as the reality condition <p(z)* = (j)(z). The 
basic idea is that since all the operators cj){z) (z G T + ) commute, it 
may be possible to find a total set of simultaneous eigenvectors for 
them. This is not guaranteed, since <p(z) is not self-adjoint (it is not 
even normal, by eq. (1)) and, in any case, it is unbounded and thus 
may present us with domain problems. However, this hope is realized 
by explicitly constructing such eigenvectors. This construction mimics 
that of the canonical coherent states in section 3.4, which used the 
lowering and raising operators A and A*. As in the case of finitely 
many degrees of freedom, the canonical commutation relations mean 
that <fi* acts as a generator of translations in the space in which <p 
is "diagonal." The construction proceeds as follows: Let f(p) be a 
function on IR S , which will also be regarded as a function on To 

simplify the analysis, we assume to begin with that / is a (complex- 
valued) Schwartz test function, although this will be relaxed later. 

/ determines a holomorphic positive-energy solution of the Klein- 
Gordon equation, 

/(*)= / dp e-*** f(p). (2) 

Define 

0*(/)=/ dpa*(p)f(p)= [ dacf>(zyf(z), (3) 

where a+ is any particle phase space and the second equality follows 
from theorem 4.10 and its corollary. (Note: this is not the same as 
the smeared field in real spacetime, since the latter would involve 
an integration over time, which diverges when / is itself a solution 
rather than a test function in spacetime.) The canonical commutation 
relations imply that for z G 7+, 

[*(*),**(/)]=/ da(z')K(z,z')f(z') = f(z), (4) 
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and for n > 1, 

tt(z), ( t>*(f) n ]=f(z)-n ( t>*(f) n - 1 . (5) 
We now define the Geld coherent states of (j) by the formal expression 

= ¥ . (6) 

Then if z G 7+, so that \l/o = 0, eq. (5) implies that 

<j>(z)Ef = [<p(z),e<t>*W]y 

= f(z)Ef. U 

Hence is a common eigenvector of all the operators (j>(z), z E T + . 
This eigenvalue equation implies that the state corresponding to E? 
is left unchanged by the removal of a single particle, which requires 
that E? be a superposition of states with 0, 1, 2, • • • particles. Indeed, 

oo 1 

E f = ^T^*(/r*o. (8) 

n=0 

The projection of E? to the one-particle subspace can be obtained by 
using the particle coherent states e z : 

(e z \Ef} = (y \<P(z)Ef) 

= f(z)(V \Ef) = f(z\ 

where the last equality follows from 4>(j) \I/ o = (</>*(/))* = 0. More 
generally, the n-particle component of is given by projecting to 
the n-particle coherent state 

e ZlZ2 ... Zn = 0(z!)>(z 2 )* • • • <t>(z n )* * , (10) 

which gives 

(e ZlZ2 ... Zn \Ef) = f(z 1 )f(z 2 )---f(z n ), (11) 

so all particles are in the same state / and the entire system of par- 
ticles is coherent! Similar states have been found to be very useful 
in the analysis of the phenomenon of coherence in quantum optics 
(Glauber [1963], Klauder and Sudarshan [1968]), where the name "co- 
herent states" in fact originated. In the usual treatment, the positive- 
frequency components have to be separated out "by hand" using their 



5. 7. Field Coherent States and Functional Integrals 



211 



Fourier representation, since one is dealing with the fields in reai 
spacetime. For us, this separation occured automatically though the 
use of the Analytic-Signal transform, i.e. <j>*(f) can be defined di- 
rectly as an integral of f(z) over <j + . (This would remain true even if 
/ had a negative-frequency component, since the integration over cr+ 
would still restrict / to positive frequencies.) 

The inner product of two field coherent states can be computed 
as follows. Note first that if g(z) is another positive-energy solution, 
then 

<Kf)E'= [ daW)<P(z)E 9 

J <T_|_ 

(l2) 

{f\g)E\ 



= I daf(z)g(z)E 9 

J <T_|_ 



where, by theorem 4.10, 

(f\g)=f daj{z)g{z)= f dpj(p)g(p). (13) 

J CT4- JVL + 



Hence 



E f E 9 ) = #o e^ (/) E 9 ) 



Thus E$ belongs to Tt (i.e., is normalizable) if and only if /(p) belongs 
to L+(<ip) or, equivalently, f(z) belongs to the one-particle space /C 
of holomorphic positive-energy solutions. If we suppose this to be 
the case for the time being, then the field coherent states are 
parametrized by the vectors / G L 2 + {dp) or / e /C. Next, we look for a 
resolution of unity in 7i in terms of the E^s. The standard procedure 
(section 1.3) would be to look for an appropriate measure dfi(f) on 
IC. Actually, it turns out that due to the infinite dimensionality of /C, 
a larger space JC' D fC will be needed to support dfx. Thus, for the 
time being, we leave the domain of integration unspecified and write 
formally 

/ dpU) \E f )(Ef\ =I n , (15) 
Jk 
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where d\i is to be found. Taking the matrix element of this equation 
between the states E h and E 9 , we obtain 

f d/i(/)e< h l'> + <'l«>=e< h l»>. (16) 

JK. 

With h = —g this gives 

/ dfi(f) e < / 1 s, >-< 9 1 / > = e -< 3ls) =%]. (17) 

The left-hand side is an infinite-dimensional version of the Fourier 
transform of do,, as becomes apparent if we decompose / and g into 
their real and imaginary parts. The Fourier transform of a measure is 
called its characteristic function. Hence we conclude that a necessary 
condition for the existence of d\x is that its characteristic function be 
S[g]. In turn, a function must satisfy certain conditions in order to be 
the characteistic function of a measure. In the finite-dimensional case, 
Bochner's theorem (Yosida [1971]) guarantees the existence of the 
measure if these conditions are satisfied. If the infinite-dimensional 
space of /'s is replaced by (E n , the above relation would uniquely 
determine d\i as a Gaussian measure. For the identity 

/ detA- 1 d 2n (exp[-n((-A0*A- 1 ((-A0] = l, (18) 

where A is a positive-definite matrix, implies 

/ dMC)e^ +rc) =e^, (19) 

with 

df,(C) = detA- 1 expt-Ti-CM-^] d 2n (. (20) 

The integral in eq. (19) is entire in the variables £ and separately, 
hence it can be analytically continued to — > — £*, giving 

/ ^(C)e^ K * C) =e-^^. (21) 
If £ = a + i(3 and £ = u + iv with a, (3,u,v G IR n , then 
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/ dn(ct, (3) e 2«(««-/3«) = e -AuAu+vAv) _ ^2) 

showing that S(v, —u) is the Fourier transform of dfi(ot,(3). 

If A is merely positive-semidefinite, i.e., if it is singular, then d\i 
still exists but is concentrated on the range of A, and A -1 makes 
sense as a map from this range to the orthogonal complement of the 
kernel of A. 

Eq. (19) is a finite-dimensional version of eq. (16) (with g = h). 
In going to the infinite-dimensional case, two separate complications 
arise. First of all, recall that in finite dimesions, the Fourier transform 
takes funcions on IR n to functions on the dual space, (IR n )* (section 
1.1). One usually identifies these two spaces by choosing an inner 
product on IR n , e.g., the Euclidean inner product. In the infinite- 
dimensional case, it is tempting to extend Bochner's theorem by let- 
ting lR n go to a Hilbert space and looking for a measure on this space. 
However, Segal [1956a, 1958] has shown that the ensuing dfi cannot 
be a Borel measure, since it is only finitely additive. To obtain a Borel 
measure, the domain of integration must be expanded to a space of 
distributions, its dual then being a space of test functions. Minlos' 
theorem (Gel'fand and Vilenkin [1964]; Glimm and Jaffe [1981]) states 
that if a functional S[g] defined on the Schwartz space of test functions 
5(IR S ) satisfies appropriate conditions (positive-definiteness, normal- 
ization and continuity), a Borel probability measure dfj, exists on the 
dual space of tempered distributions 5'(IR S ) such that eq. (17) is 
satisfied for all g G 5(IR S ), the integration being over S / (IR S ). This 
resolves the problem of infinite dimansionality. In our case, however, 
the situation is further complicated by the fact that the space of /'s 
over which we wish to integrate, even if it is enlarged, consists not 
of free functions but of solutions of the Klein-Gordon equation. This 
difficulty can be overcome by first applying Minlos' theorem in the 

momentum space representation, where S[g] = exp ( — ll^llia^p) ) sa t- 

isfies the necessary conditions for g G <S(IR S ). This gives a probability 

measure dfl(f) on iS>'(IR s ). We then define the spaces 



/Co = {g(z) = dp e-**»g(p) \ges} 
/C = {/(*) = dp e-*** /(p) | / G S' } 



(23) 
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These may be regarded as mutually dual, under the sesquilinear pair- 
ing 



( 



f,g) = (f,g) = ! + dp f(p)9(p), f eS',ge S. (24) 



Together with /C, they form a "triplet" 

/C C K C /C . (25) 

We now use the map / i— > / to transfer the measure from S' to /C , 
obtaining a probability measure d\i on /C . This results, finally, in the 
resolution of unity 

/ dn(f) \E')(E*\ =/, (26) 
Jk' 

where the integral converges, as usual, in the weak operator topology. 
d\x is Gaussian in the sense that its restrictions to finite-dimensional 
cylinder sets in /C are all Gaussian measures. It is, therefore, an 
infinite-dimensional version of the Gaussian measure on (E s which 
gave the resolution of unity for the canonical coherent states in section 
1.2. 

One might well ask what is the point of insisting that the inte- 
gration take place over /C rather than iS'(IR s ), the momentum space 
representation. One reason is esthetic: The vectors combine the 
finite-dimensional (particle) coherent-state representation with the 
infinite-dimensional (field) coherent-state representation. Another 

reason is that whereas the "sample points" / in 5'(IR S ) are merely 
distributions, the elements / in )C' are holomorphic functions, since 

the decaying exponential e~ yp dominates the singular behavior of /, 
just as it did when / e L 2 (dp). Note, however, that non-normalizable 
field coherent states E? now enter the resolution of unity. In fact, the 
Hilbert space /C has measure zero with respect to dfi, since L 2 (dp) has 
measure zero with respect to dfl. This is remedied by the fact that 
only vectors h, g in the test function space /Co are now allowed in eq. 

(i6). i ' ; ^ 

With the resolution of unity provided by the field coherent states, 
the inner product in TL can be represented by the functional integral 



5. 7. Field Coherent States and Functional Integrals 



215 



= / dii{f)($\Ef)(Ef\V) 

= [ duu) wmn- 



(27) 



The above construction was for a neutral scalar field. If 4> were 
charged, its coherent states would take the form 

E f,S = e r(f)+K§)y 0i (28) 

where <j>*(f) is as before and <j)(g) is an antiparticle creation operator, 

<I>{S)= [ dacf>(z)g(z), (29) 

which commutes with 0*(/). g(z) is a negative-energy solution of the 
Klein-Gordon equation, holomorphic in 71, or, equivalently, g(z) = 
g(z) with g E IC' . Thus we write g G JC' . The resolution of unity for 
charged fields is therefore 

/ _dn(f,g)\Ef>B)(EM\ =I n , (30) 



where dfj,(f,g) = d/i(f) dfi(g) is the tensor product of two Gaussian 
measures defined as above. 

Finally, it is reasonable to ask whether coherent states exist for an 
interpolating field, by analogy with the interpolating particle coherent 
states studied in sections 5.3 and 5.6. A necessary condition would 
seem to be that the first half of the canonical commutation relations 
still be valid, i.e., 



[<P(z),<P(z')} = (31) 

for z, z' G 7+, since one would like to find simultaneous eigenvectors 
of 4>{z) for all z G T + . Recall that the extended free neutral scalar 
field had the form 



<P(z)= [ dp [9- lzp a(p) + e %zp a*(p)] 



(32) 
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for arbitrary z G (E s , and the commutation relation given by eq. 
(31) was due to the polarization of the positive and negative frequen- 
cies into T + and 71, respectively. If interactions are introduced, the 
positive- and negative-frequency components get inextricably mixed 
together, hence it is highly unlikely that the above commutation re- 
lation survives. However, a charged free field has the form 

<f>(z)= [ dp [6- izp a(p)+6 lzp b*(p)], (33) 

where b* commutes with a, hence 

[0(z), cj)(z')] = Vz, z e (E s+1 . (34) 

I believe that this relation does have a chance of holding for interpo- 
lating charged scalar fields. It would be a consequence, for example, of 
the physical requirement that the Lie algebra generated by the field 
has no operators which remove (or add) a double charge 2e. This 
commutation relation is the weaker half of the free-field canonical 
commutation relations, the stronger half (which we do not assume) 
being that [<j>(z), <J}(z')*] is a "c-number," i.e. a multiple of the iden- 
tity. If [(f>(z), <j>(z')] = for all z and z', then it makes sense to look 
for common eigenvectors of all the <j>(z)% which would be coherent 
states of the interpolating field. 



Notes 

Most of the results in sections 5.2-5.5 were announced in Kaiser 
[1987b] and have been published in Kaiser [1987a]. An earlier at- 
tempt to describe quantized fields in complex spacetime was made 
in Kaiser [1980b] but was found to be unsatisfactory. The Analytic- 
Signal transform is further studied in Kaiser [1990c]. 

Segal [1963b] proposed a formulation of quantum field theory in 
terms of the symplectic geometry of the phase space of classical fields. 
This phase space corresponds, roughly, to the space JC' Q defined in sec- 
tion 5.7. An attempt to study quantized fields as (operator-valued) 
functions on the Poincare group-manifold has been made by Lurgat 
[1964]; see also Hai [1969]. As mentioned in section 4.2, this manifold 
may be regarded as an extended phase space which includes spin de- 
grees of freedom in addition to position- and velocity coordinates. In 
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the context of classical field theory, this point of view has been gener- 
alized to curved spacetime by replacing the Poincare group-manifold 
with the orthogonal frame bundle over a Lorentzian spacetime (Toller 
[1978]). These efforts have not, however, utilized holomorphy. It may 
be interesting to expand the point of view advocated here to a complex 
manifold containing the Poincare group-manifold in order to account 
naturally for spin. This might result in a "total" coherent-state rep- 
resentation where the classical phase space coordinates range over T 
and the spin phase space coordinates range over the Riemann sphere, 
as in section 3.5. 

I owe special thanks to R. F. Streater for many important com- 
ments and corrections in this chapter. 
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Chapter 6 
FURTHER DEVELOPMENTS 



6.1. Holomorphic Gauge theory 

In this section we give a brief, not-very-technical but hopefully intu- 
itive, discussion of gauge theory and indicate how it may be modified 
in order to make sense in complex spacetime. This represents work 
still in progress, and our account is accordingly incomplete. One ob- 
stacle is the absence, so far, of a satisfactory Lagrangian formulation. 
This is due in part to the fact that fields in complex spacetime are 
constrained since they can be derived from local fields in IR S+1 (chap- 
ter 5). Our treatment is as elementary as possible, with a geometrical 
emphasis. We ignore global questions and work within a single chart. 
For more details, see Kaiser [1980a, 1981]. 

Gauge theory is a natural, geometric way of introducing interac- 
tions. It applies some of the ideas of General Relativity to quantum 
mechanics and arrives at a class of theories which are generalizations 
of classical electrodynamics, the latter being the simplest case. The 
power of Einstein's theory of gravitation owes much to the fact that it 
actually assumes less, initially, than its predecessor, Newton's theory. 
By dropping the assumption that spacetime is flat, we lose the ability 
to transport tangent vectors from one point in spacetime to another, 
as is done when differentiating a vector field or even finding the ac- 
celeration of a particle moving in spacetime. To regain it, we need a 
connection. We need to know how a vector transforms in going from 
any given point Xq to a neighboring point along a curve x(t). The 
tangent vector to the curve at the point xq is 

X = x^. (1) 

(The partial-derivative operators = d/dx^ form a basis for the 
tangent space at x ; see Abraham and Marsden [1978].) An infinites- 
imal transport must have a linear effect, thus a vector Y at xq should 
change by 
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SY = T(X)Y8t, (2) 

where Y(X) is a linear transformation on the tangent space at xq. Fur- 
thermore, Y(X) must be linear in X since the latter is an infinitesimal 
(i.e., linearized) description of the curve at xq. If Y = Y v d v , this gives 

5Y = x»Y v T{d ll )d v 8t. (3) 
Since r(<9 M ) is a linear transformation, we have 

5Y = x»Y v T%d K 8t, (4) 

where T^,, set of (locally defined) funcions on spacetime, known 
as the connection coefficients. This gives the rate of change of Y due 
to transport along X. 

Now suppose we are given a metric g on spacetime. (In Relativ- 
ity, the "metric" is indefinite, i.e. Lorentzian rather than Rieman- 
nian; with this understood, we continue to call it a metric.) If two 
vectors are transported along a curve, then their inner product must 
not change since it is a scalar. This gives a relation betweeen Y and 
g which determines the symmetric part (with respect to exchange of 
X and Y) of Y. The antisymmetric part is the torsion, which in the 
standard theory is assumed to vanish. Hence the metric uniquely 
determines a torsionless connection known as the Riemannian con- 
nection. 

General Relativity (see Misner, Thorne and Wheeler [1970]) re- 
lates the Riemannian connection to Newton's gravitational potential, 
thus giving gravity a geometric interpretation. This is reasonable, 
since the connection determines the acceleration of a particle moving 
freely (i.e., along a geodesic) in spacetime, which is related to gravity. 
By assuming less to begin with, one discovers what must necessar- 
ily be added to even compute the acceleration, and this additional 
structure turns out to include gravity! In Newton's theory, gravity 
must be added in an ad-hoc fashion. The two theories coincide in the 
non-relativistic limit. 

Now consider the wave function of a (scalar) quantum particle, for 
example a solution of the free Klein-Gordon equation, in real space- 
time IR S+1 . Suppose we drop the usually implicit assumption that 
/ can be differentiated by simply taking the difference between its 
values at neighboring points. Instead of regarding f(x) as a com- 
plex number, we now regard it as a one-dimensional complex vector 
attached to x, analogous to the tangent vectors in Relativity. This 
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means assuming less structure to begin with, since f(x) now belongs 
to (E as a vector space rather than as an algebra. The complex plane 
attached to x will be denoted by (E x and is called the fiber at x, and 
the set of all fibers is called a complex line bundle over IR S+1 (Wells 
[1980]). To differentiate /, we must know how it is affected by trans- 
port. The situation is similar to the one above, only now T(X) must 
be a 1 x 1 complex matrix, i.e. a complex number. An infinitesimal 
transport gives 

5f = T(X)f5t = x»TJ5t, (5) 

where T^(x) is a complex-valued function. The total rate of change 
along X is therefore 

x^f + x^TJ = D x f, (6) 

where Dx is called the covariant derivative along X . Equivalently, 
the differential change is 

Df = df + Tf, where r = T M dx^. (7) 

The 1-form V is called the connection form. Df is the sum of a "hor- 
izontal" part df (which measures change due to the dependence of / 
on x) and a "vertical" part (which measures change due to transport). 
A gauge transformation is represented locally by a multiplication by 
a variable phase factor, i.e. 

f(x)^e^f(x). (8) 

This is a linear map on each fiber (E x , which corresponds in Rela- 
tivity to the linear map on tangent spaces induced by a coordinate 
transformation. In fact, since there is no longer any natural way to 
identify distinct fibers, a gauge transformation is a coordinate trans- 
formation of sorts. We therefore require that Df be invariant under 
gauge transformations, which implies that V transforms as 

r ^ r - idx- (9) 

Suppose now that we try to complete the analogy with Relativity 
by deriving V from a Hermitian metric on the fibers such that the 
scalar product of two vectors remains invariant under transport. If 
we assume the metric to be positive-definite, it must have the form 



(f(x),g(x)) = f(x)h(x)g(x), 



(10) 
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where h(x) is a positive function. It will suffice to consider the inner 
product of f(x) with itself, i.e. the quantity 

p(x) = (f(x), f(x)) = W)Hx) f{x). (11) 

We require that p be invariant under transport. This means that 
(/, /) changes only due to its dependence on x, i.e. 

(Df,f) + (f,Df)=d(f,f). (12) 

It follows that 

Th + hT = dh, (13) 

which constrains the real part of V but leaves the imaginary part 
arbitrary. Writing T = R + iA, where R and A are real 1-forms, we 
have 

2R = d\ogh. (14) 

The real part R of the connection can be transformed away by defining 
/ = h}' 2 f and h = 1, which gives p = p and 

Df = h- 1 / 2 (df + iAf) = h- 1 / 2 (d + f )/. (15) 

Since p = f f, we may as well assume from the outset that p = \ f\ 2 
and r = iA is purely imaginary. 

Note: The mapping / i— > / is not a gauge transformation in the 
usual sense; in the standard gauge theory the metric is assumed to be 
constant (h(x) = 1), hence only phase translations are allowed. This 
corresponds to having already transformed away R. It turns out that 
in phase space, it will be natural to admit non-constant metrics. 

To make the Klein-Gordon equation invariant under gauge trans- 
formations, we now replace <9 M by _D M = + iA^. The result is 



(D^D^ + m 2 )f = (0" + iA^ + iAJf + m 2 f = 0. (16) 

This equation was known (even before gauge theory) to be a relativis- 
tically covariant description of a Klein-Gordon particle in the pres- 
ence of the electromagnetic field determined by the vector potential 
A pix). Hence the connection, which describes a geometric property 
of the the complex line bundle, acquires a physical significance with 
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respect to electrodynamics, just as did the connection V with respect 
to gravitation. When / is differentiated in the usual way, it is un- 
consciously assumed that the connection vanishes. Coupling to an 
electromagnetic field then has to be put in "by hand," through the 
substitution — > <9 M + which is known as the minimal coupling 
prescription. Gauge theory gives this ad-hoc prescription a geometric 
interpretation. But note that in this case, the fiber metric h(x) did 
not determine the connection. This is due to the complex structure: 
iAf cancels in the inner product because it is imaginary. The elec- 
tromagnetic field generated by the potential A is given by the 2-form 

F = dA, (17) 

which in fact measures the non-triviality of the connection form A: 
if F = 0, then A is closed and therefore (locally) exact, i.e. it is 
due purely to a choice of gauge. This is analogous in Relativity to 
choosing an accelerating coordinate system, which gives the illusion 
of gravity. 

Note the complementary nature of the two theories: in Relativity, 
the skew part of the connection, which is the torsion, is assumed to 
vanish. In gauge theory, the inner product becomes Hermitian, and 
the symmetric and antisymmetric parts correspond to its real and 
imaginary parts, respectively. It is the reai part of the connection 
which is assumed to vanish in gauge theory. Were V required to 
be real, it could in fact be transformed away as above, giving rise 
to no gauge field. That is, only the trivial part of the connection 
is determined by the metric. The non-trivial (imaginary) part is 
arbitrary. We will see that when the theory is extended to complex 
spacetime, the metric does determine a non-trivial connection. 

Gauge theory usually begins with a Lagrangian invariant under 
phase translations / t— > e^f, which form the group U(l). The equa- 
tions satisfied by / and A^ are derived using variational principles 
(see Bleeker [1981]). There is a natural generalization where f(x) is 
an n-dimensional complex vector. In that case, the group of phase 
tanslations is replaced by a non-abelian group G, usually a subgroup 
of U(n). G is called the gauge group, and the correponding gauge 
theory is said to be non-abelian or of the Yang-Mills type. Such 
theories have in recent years been applied with great success to the 
two remaining known interactions (aside from gravity and electro- 
magnetism), namely the weak and the strong forces, which involve 
nuclear matter (Appelquist et al. [1987]). 

Let us now see how non-abelian gauge theory may be extended 
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to complex spacetime. (This will include the abelian case of electro- 
dynamics when n = 1.) Consider a field / on complex spacetime, 
say on the double tube T, whose values are n-dimensional complex 
vectors. The set of all possible values at z E T is a complex vector 
space F z m (E n called the fiber at z. The collection of all fibers is 
called a vector bundle. We assume that this bundle is holomorphic 
(Wells [1980]), so that holomorphic sections z \— > f(z) G F z , repre- 
sented locally by holomorphic vector-valued functions, make sense. 
Upon transport along a curve z(t) having the complex tangent vector 
Z, f changes by 

6f = 6(Z)f6t, (18) 

where 9{Z) is a linear map on each fiber. The total differential change 
is 

Df = (d+e)f. (19) 

If z = x — iy, then 

d = d + B, (20) 



where 



d = dz»d u =ldz» ( + i 



2 \dx» dy 

1/0 d 
d = dz»d„ = ^dz» [ -^--i- 



(21) 



" 2 d y". 

Since a general tangent vector has the form 

Z = Z li d li + Z fi d li , (22) 

we have 



9(z) = e^dz^ + e fi dz^ (23) 

where 6> M = 6>(<9 M ) and 6>^ = 6>(<9p. 

Let us now try again to derive the connection from a fiber metric, 
as we have failed to in the case of real spacetime. A positive-definite 
metric on the fibers F z must have the form 



(f(z),g(z)) = f(z)*h(z)g(z) 



(24) 
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where h(z) is a positive-definite matrix. Again, it will suffice to con- 
sider the (squared) fiber norm 

p(z) = (f(z), f(z)). (25) 

We define a holomorphic gauge transformation to be a map of the 
form 

f(z)^f'(z) = X (z)- 1 f(z), 
h(z)^ti = X (zyh(z)x(z), 

where x( z ) * s an invertible nxn matrix-valued holomorphic function. 
Clearly p is invariant under holomorphic gauge transformations. The 
corresponding gauge group acting on a single fiber F z is the general 
linear group G = GL(n,(f5), which includes the usual gauge group 
U(n). However, analyticity correlates the values of x( z ) & t different 
fibers. Invariance of the inner product under transport gives 

(Df,f) + (f,Df) = d(f,f), (27) 
from which we obtain the matrix equation 

9*h + h9 = dh = dh + dh. (28) 

As in the case of real spacetime, this only determines the Hermitian 
part (relative to the metric) of 9. But if we make the Ansatz 

h9 = dh, (29) 

then the resulting connection 9 = h~ 1 dh satisfies the above constraint. 

We now show that in general, the connection 9 is non-trivial, i.e. 
cannot be transformed away by a holomorphic gauge transformation. 
Under such a transformation, 9 becomes 

= x - 1 h- 1 dh X + X- 1 d X (30) 
= X -1 #X + X -1 dx, 

since d X * = by analyticity. It follows that 

iyf = {d+ff)f = x- x D{xn (si) 

hence 

(D') 2 f = X ~ 1 D 2 ( X f). (32) 
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If 9 were trivial, then for some gauge we would have 6' = 0, hence 
(D') 2 = d 2 = 0, so D 2 = 0. But 

D 2 f = (d + e) (d + e) f = d{e f) + e a df + e a of 

= (d0)/ + 0A0/ = e/, 
where the 2-form 

Q = d9 + 9A9 = d9 + d9 + 9A9 (34) 

is the curvature form of the connection 9, analogous to the Riemann 
curvature tensor in Relativity. Using the matrix equation 

dbT x = -h~ 1 dh-h- 1 (35) 
and d 2 = B 2 = dd + Bd = 0, we find 



89 + 9 A 9 = (-h^dh ■ h' 1 ) A dh + (h^dh) A (h^dh) = 0. (36) 

This is an integrability condition for 9, being a consequence of the fact 
that 9 can be derived from h. One could say that h is a "potential" for 
9. Therefore the quadratic term cancels in eq. (34) and the curvature 
form reduces to 

= 39. (37) 

Hence if h is such that B (h~ 1 dti) ^ 0, then the connection is non- 
trivial. The form G is the complex spacetime version of a Yang-Mills 
field, and 9 corresponds to the Yang-Mills potential. 

For n = 1, 9 and G are the complex spacetime versions of the 
electromagnetic potential and the electromagnetic field, respectively. 
Since h(z) is a positive function, it may be written as 

h(z) = e-^ z) (38) 

where <p(z) is real. Then 

9 = -d<f>, G = -Bde. (39) 

To relate 9 and G to the electromagnetic potential A and the electro- 
magnetic field F, one performs a non-holomorphic gauge transforma- 
tion similar to that in eq.(15): let 
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/» = e-^/ 2 f(z), h(z) = 1. (40) 
Then the transformed potential becomes purely imaginary, 

e = - l -^dx^ (4i) 

giving 

for the complex spacetime version of the electromagnetic potential. 
Note that although A^(z) is a pure gradient in the y-direction, the 
corresponding electromagnetic field is not trivial, since 



dA^ OA, 



dx v dx^ . . 

d 2 ^ d 2 <p "1 (Jd) 



dx^dy v dx v dy^ 



need not vanish, in general. 

Incidentally, there is an intriguing similarity between the inner 
product using the fiber metric, 

(f\g)= [ dfi(z)J(zje- Hz) g(z), (44) 



and that in Onofri's holomorphic coherent states representation (sec. 
3.4, eq. (62)). Note that our O coincides with Onofri's symplectic 
form — uj. This possible connection remains to be explored. 
Remarks. 

1. The relation between the Yang-mills potential A and the Yang- 
Mills field F in real spacetime is F = dA + A A A, which is 
quadratic in the non-abelian case (n > 1), since the wedge prod- 
uct AAA involves matrix multiplication. In our case, however, the 
connection satisfies the integr ability condition given by eq. (36), 
hence the quadratic term cancels and the relation becomes linear, 
just as it is normally in the abelian case. The non-holomorphic 
gauge transformation / i— > / in eq. (40) can be generalized 
to the non-abelian follows: Since h(z) is a postive ma- 

trix, it can be written as h(z) = k(z)*k(z), where k(z) is an 
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n x n matrix. (k(z) need not be Hermitian; the holomorphic case 
dk = corresponds to a "pure gauge" field, i.e. G = 0.) Setting 
f(z) = k(z)f(z) and h(z) = 1 brings us to the unitary gauge, 
where the new gauge transformations are given by unitary mat- 
ices f(z) i— > U(z)f(z). This amounts to a reduction of the gauge 
group from GL(n, (E) to U(n). In the unitary gauge, the relation 
between the connection and the curvature becomes non-linear, as 
it is in the usual Yang-Mills theory. See Kaiser [1981] for details. 

2. G has a symmetric real part and an antisymmetric imaginary 
part. The antisymmetric part corresponds to the usual Yang- 
Mills field, whereas the symmetric part does not seem to have an 
obvious counterpart in real spacetime. 

3. In complex differential geometry, 9 = h~ 1 dh is known as the 
canonical connection of type (1, 0) determined by h. (Wells 
[1980]). The functions / are assumed to be (local representations 
of) holomorphic sections of the vector bundle. We do not make 
this assumption, since it appears that analyticity may be lost in 
the presence of interactions. However, it is possible that f(z) is 
holomorphic, and it is the non-holomorphic gauge transformation 
/ l— ^ / which spoils the analyticity. That is, the non-analytic part 
of the theory may be all contained in the fiber metric h. 



6.2 Windowed X— Ray Transforms: Wavelets Revisited 

In this final section we generalize the idea behind the Analytic-Signal 
transform (section 5.2) and arrive at an n-dimensional version of the 
Wavelet transform (chapters 1 and 2). In a certain sense, relativistic 
wave functions and fields in complex spacetime may be regarded as 
generalized wavelet transforms of their counterparts in real spacetime. 
This is related to the fact, mentioned earlier, that relativistic windows 
shrink in the direction of motion, due to the Lorentz contraction as- 
sociated with the hyperbolic geometry of spacetime. This contraction 
is like the compression associated with wavelets. 

Other generalizations of the wavelet transform to more than one 
dimension have been studied (see Malat [1987]), but they are usually 
obtained from the one-dimensional one by taking tensor products, 
hence are not natural with respect to symmetries of IR n (such as ro- 
tations or Lorentz transformations), and consequently do not lends 
themselves to analysis by group-theoretic methods. The generaliza- 
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tion proposed here, which we call a windowed X-Ray transform, does 
not assume any prefered directions, hence it respects all symmetries 
of IR n . For example, the transforms of functions over real spacetime 
1R S+1 will transform naturally under the Poincare group. If these 
functions form a representation space for P (whether irreducible, as 
in the case of a free particle, or reducible, as in the case of systems of 
interacting fields), then so do their transforms. 

Let us start directly with the windowed X-Ray transform (Kaiser 
[1990b]). Fix a window function h:1R — > (E, which will play a role 
similar to a "basic wavelet." For a given (sufficiently well-behaved) 
function / on IR n , define f h on IT x IR n by 

/CO 
dthjtjf(x + ty). (1) 
-co 

Note that for n = 1 and y ^ 0, 



h(x,y) = \y\- 1 J dt'h^—^jf(t') 
= M" 1/2 (Wf)(x,y), 



(2) 



where Wf is the usual Wavelet transform based on the affine group 
A = IR©IR*, defined in section 1.6. On the other hand, for arbitrary 
n but 

fh coincides with the Analytic-Signal transform defined in section 
5.2. The name "windowed X-Ray transform" derives from the fact 
that for the choice h = 1, fh(x,y) is simply the integral of / along 
the line x(t) = x + ty, and if \y\ = 1 this is known as the X-Ray 
transform (Helgason [1984]), due to its applications in tomography 
(Herman [1979]). 

Returning to n dimensions and an arbitrary window function h(t), 
we wish to know, first of all, whether and how / can be reconstructed 
from fh. Note that the transform is trivial for y = 0, since 



f h (x,0) = f(x) j dth(t), (4) 

so we rule out y = and let y range over IR" = IR n \{0}. Note also 
that fh has the following dilation property for a ^ 0: 
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f h {x,ay) = j ' dt\a\ 1 h(t/a) f(x + ty) = f ha (x,y), (5) 

where h a (t) = |a| _1 h(t/a). 

To reconstruct /, begin by formally substituting the Fourier rep- 
resentation of / into fh'- 

f h (x,y) = Jdt J d n pe- 2 ^ +t ^W)f(p) 

= J d n pe- 27ripx h(py)f(p) ( 6 ) 

= ( h X) y \f) L 2 = ( h Xj y | / ) L 2, 

where h XjV is defined by 

h x , y (p)=e 2 ^h(py), (7) 

so that 

h x , y {x') = j Oe" 2 ^'" 1 ' Hpy). (8) 

Note that h Xty , and hence also h x>y , is not square-integrable for n > 1, 
since its modulus is constant along directions orthogonal to y* This 
means that eq. (6) will not make sense for arbitrary / G L 2 (IR n ). We 
therefore assume, initially, that / is a test function, say it belongs to 
the Schwartz space iS(IR n ). Then eq. (6) makes sense with ( | ) as 
the (sesquilinear) pairing between distributions and test functions. If 
h is also sufficiently well-behaved, then we can substitute 

h(py) = j duh(u)6(py-u) (9) 
and change the order of integration in eq. (8), obtaining 

h x , y {x') = J duh(u) J d n pe- 2 ^ x '- x h(py-u). (10) 

* So far, we have not assumed any metric structure on IR S+1 . Recall 
(sec. 1.1) that the natural domain of f(p) is the dual space (IR n )*. 
"Orthogonal" here means that p(y) = as a linear functional. This 
will be important when lR n is spacetime with its Minkowskian metric. 
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To reconstruct /, we look for a resolution of unity in terms of 
the vectors h XtV (chapter 1). That is, we need a measure dp,(x,y) on 
IET x IR™ such that 

J dp(x,y)\f h (x,y)\ 2 = \\f\\l 2 . (11) 

For then the map T: f i— > fh is an isometry onto its range in L 2 (dp), 
and polarization gives 

(g\f) = (Tg\Tf) = (g\T*Tf), (12) 

showing that / = T*(Tf) in L 2 (IR n ), which is the desired reconstruc- 
tion formula. There are various ways to obtain a resolution of unity, 
since / is actually overdetermined by fh, i.e. giving the values of fh 
on all of IR n x IR™ amounts to "oversampling," so fh will have to 
satisfy a consistency condition. We have seen several examples of this 
in the study of the windowed Fourier transform (section 1.5) and the 
one-dimensional wavelet transform (section 1.6 and chapter 2), where 
discrete subframes were obtained starting with a continuous resolu- 
tion of unity. However, for n > 1, there are other options than discrete 
subframes, as we will see. In the spirit of the one-dimensional wavelet 
transform, our first resolution of unity will involve an integration over 
all of IET x EC Note that 

f h (x,y)= (K(py)/)», (13) 
so Plancherel's theorem gives 

J d n x\f h (x,y)\ 2 = J dy\h(py)\ 2 \f(p)\ 2 . (14) 

We therefore need a measure dp(y) on IR™ such that 

H(p) = J dp{y) \h{py)\ 2 = 1 for almost all p. (15) 

The solution is simple: Every p ^ can be transformed to q = 
(1, 0, ... 0) by a dilation and rotation of IR n . That is, the orbit of q 
(in Fourier space) under dilations and rotations is all of IR^. Thus we 
choose dp to be invariant under rotations and dilations, which gives 

dp(y) = N\y\- n d n y, (16) 
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where iV is a normalization constant and \y\ is the Euclidean norm of 
y. Then for p^O, 



H(p) = H(q)=N J \y\- n d n y\h( yi )\ 
= N J d yi \h{ yi )\ 2 J 



2 



dy 2 --- dy n ' ' ' 



(v? + ■ ■ ■ + y 2 n ) n/2 ' 

But a straightforward computation gives 

dyi • • • dy n W 2 



/ 



(y 2 + --- + y 2 ) n/2 |yi|r(n/2)' 



(18) 



This shows that the measure dfi(x,y) = d n xdp(y) gives a resolution 
of unity if and only if 

Oh = J |lMO! 2 <oo, (19) 

which is precisely the admissibility condition for the usual (one- 
dimensional) Wavelet transform (section 1.6). If h is admissible, the 
normalization constant is given by 



T(n/2) 

7T n / 2 Ch 

and the reconstruction formula is then 



N = (20) 

— 71/ 2 - 



-^h^J?jfh{x,y). (21) 

i y i 

The sense in which this formula holds depends, of course on the be- 
havior of /. The class of possible /'s, in turn, depends on the choice 
of h. A rigorous analysis of these questions is not easy, and will not 
be attempted here. Note that in spite of the factor \y\ n in the denom- 
inator, there is no problem at y = since 

f h (x,0) = Mti)f(x) = (22) 

by the admissibility condition. The behavior of fh for small y can 
be analized using the dilation property, since eq. (5) implies that for 
A > 0. 
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f h (x,y/\) = J dt\h(Xt)f(x + ty). (23) 

Thus if h(t) decays rapidly, say if 

Xh(Xt) ^ as A -> co, (24) 

then we expect fh(x, y/X) — > as A — > oo. 

Since eq. (11) holds for admissible /i, we can now allow / e 
L 2 (IR n ). We would like to characterize the range of the map 
T: f h from L 2 (R n ) to L 2 (cfy/). The relation 

fh(x,y) = (h x , y \f) L 2 (25) 

shows that /i X;J , acts like an evaluation map taking fh e L 2 (d/i) to 
its "value" at (x,y). These linear maps on 3?t are, however, not 
bounded if n > 1, since then h XjV is not square-integrable. (In general, 
the "value" of fh at a point may be undefined.) Hence is not a 
reproducing-kernel Hilbert space (chapter 1). But in any case, the 
distributional kernel 



K(x,y; x\y') = (h XtV \ h x > tV , ) 

Jj^ e -^)kWhW) (26) 

represents the orthogonal projection from L 2 (d/j) onto 3?t- Thus a 
given function in L 2 (d[i) belongs to 3?t if and only if it satisfies the 
consistency condition 

9(x, V) = J dn(x', y') K(x, y; x' , y') g(x', y'), (27) 

where the integral is the symbolic representation of the action of K 
as a distribution. 



Remarks. 

1. For n = 1, the reconstruction formula is identical with the one for 
the continuous one-dimensional wavelet transform W f ', since by 
Eq. (2), 

[ d -^\h(*,y)\^ [ d ^\(wf)( x ,y)f. (28) 
j I y I j y 
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2. In deriving the resolution of unity and the related reconstruction 
formula, we have tacitly identified IR n as a Euclidean space, i.e. 
we have equipped it with the Euclidean metric and identified the 
pairing px in the Fourier transform as the inner product. The ex- 
act place where this assumption entered was in using the rotation 
group plus dilations to obtain IR™ from the single vector q, since 
rotations presume a metric. 

Having established fh as a generalization of the one-dimensional wa- 
velet transform, let us now investigate it in its own right. First, note 
that for n = 1 there were only two simple types of candidates for 
generalized frames of wavelets: (a) all continuous translations and 
dilations of the basic wavelet, or (b) a discrete subset thereof. For 
n > 1, any choice of a discrete subset of vectors h x>y spoils the in- 
variance under continuous symmetries such as rotations, and it is 
therefore not obvious how to use the above group-theoretic method 
to find discrete sub frames. In fact, the discrete subsets {(a m ,na m 6)} 
which gave frames of wavelets in section 1.6 and chapter 2 do not form 
subgroups of the affine group. One of the advantages of using tensor 
products of one-dimensional wavelets is that they do generate dis- 
crete frames for n > 1, though sacrificing symmetry. However, other 
options exist for choosing generalized (continuous) subframes when 
n > 1, and one may adapt one's choice to the problem at hand. Such 
choices fall between the two extremes of using all the vectors h XjV and 
merely summing over a discrete subset, as seen in the examples below. 

1. The X-Ray Transform 
The usual X-Ray transform is obtained by choosing h(t) = 1, which 
is not admissible in the above sense; hence the above "wavelet" re- 
construction fails. The reason is easy to see: Note that now fh has 
the following symmetries: 

fh(x,ay) = \a\~ 1 f h (x,y) Va G IR* 
f h (x + sy,y) = h(x,y) Vs G IR. 

Together, these equations state that fh depends only on the line of 
integration and not on the way it is parametrized. The first equation 
shows that integration over all y ^ is unnecessary as well as unde- 
sirable, and it suffices to integrate over the unit sphere \y\ = 1. The 
second equation shows that for a given y, it is (again) unnecessary and 
undesirable to integrate over all x, and it suffices to integrate over the 
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hyperplane orthogonal to y. The set of all such (x,y) does, in fact, 
correspond to the set of all lines in IR n , and the corresponding set of 
h x ,yS forms a continuous frame which gives the usual reconstruction 
formula for the X-Ray transform (Helgason [1984]). The moral of the 
story is that sometimes, inadmissibility in the "wavelet" sense carries 
a message: Reduce the size of the frame. 

2. The Radon Transform 
Next, choose v G IR and 



h„(t) = e~ 2nlvt . (30) 

Like the previous function, this one is inadmissible, hence the 
"wavelet" reconstruction fails. Again, this can be corrected by un- 
derstanding the reason for inadmissibility. Eq. (6) now gives 

//i„(*,v) = / d n pe- 2 ^8(py-is)f(p). (31) 
For any a ^ 0, we have 

fh v {?,ay) = \a\~ 1 f h ^(x,y), (32) 

where uj = vja. Hence it suffices to restrict the y-integration to the 
unit sphere, provided we also integrate over v e JR. Also, for any 
t G IR, 

f hv (x + ry,y) = J d n p e~ 2 ^ e~ 2 ™™ S(py - u) f{p) 
= e- 2 —f h „(x,y). 



Fixing x = 0, the function 

(Rf)(y,u) = f hv (0,y) (34) 

is called the Radon transform of / (Helgason [1984]). It may be 
regarded as being defined on the set of all hyperplanes in the Fourier 
space (IR n )*, and / can be reconstructed by integrating over the set 
of these hyperplanes. 

3. The Fourier-Laplace Transform 

Now consider 
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h(t) = i^r- ^ (35) 

which gives rise to the Analytic-Signal transform. (We have adopted 
a slightly different sign convention than is sec. 5.2. Also, note that 
we have re-inserted a factor of 2tt in the exponent in the Fourier 

transform, which simplifies the notation.) Then h is the exponential 
step function (sec. 5.2) 

h(t) = 0(t)e- 2 "* = 6- 2ir t, (36) 

and eq. (6) reads 

f h (x,y)= [ d n p9(py)e- 2 ^ x -^f(p) 

, , ~ (37) 

)e 2^p(x-zy) 



[ d"p< 
J M„ 



where M y is the half-space {p \py > 0}. This is the Fourier-Laplace 

transform of / in M y . For n = 1 and y > 0, it reduces to the usual 
Fourier-Laplace transform. 

This h, too, is not admissible. f(x) can be recovered simply by 
letting y — > 0, and fh(x,y) may be regarded as a regularization of 

f(x). If the support of / is contained in some closed convex cone 
T* C (IR n )*, then fh(x,y) = f(x — iy) is holomorphic in the tube 7r 
over the cone V dual to T*, i.e. 

r = { y g it \py > o Vpern 

_ (38) 



T r = {x - iy G (E n | y G T}. 



(Note that no metric has been assumed.) In that case, f(x) is a 
boundary value of f(x — iy). This forms the background for the theory 
of Hardy spaces (Stein and Weiss [1971]). We have encountered a 
similar situation when IR n was spacetime (n = s + 1), V* = V+, and 
f(x) was a positive-energy solution of the Klein-Gordon equation; 
then V = V and Tr = T + . But in that case, f(x) was not in L 2 (IR S+1 ) 
due to the conservation of probability. There it was unnecessary and 
undesirable to integrate |/(-z)| 2 over all of T + since it was determined 
by its values on any phase space o + C T + , and reconstruction was 
then achieved by integrating over cr + (chapter 4). 
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As seen from these examples, the windowed X-Ray transform has 
the remarkable feature of being related to most of the "classical" inte- 
gral transforms: The X-Ray, Radon and Fourier-Laplace transforms. 
Since the Analytic-Signal transform is a close relative of the multi- 
variate Hilbert transform H y (sec. 5.2), we may also add H y to this 
collection. 



References 



237 



REFERENCES 

Abraham, R. and Marsden, J. E. 

[1978] Foundations of Mechanics, Benjamin/ Cummings, Reading, 
MA. 

Abramowitz, M. and Stegun, L. A., eds. 

[1964] Handbook of Mathematical Functions, National Bureau of 
Standards, Applied Math. Series 55. 
Ali, T. and Antoine, J.-P. 

[1989] Ann. Inst. H. Poincare 51, 23. 
Applequist, T. Chodos, A. and Freund, P. G. O., eds. 

[1987] Modern Kaluza-Klein Theories, Addison-Wesley, Reading, 
MA. 

Aronszajn, N. 

[1950] Trans. Amer. Math. Soc. 68, 337. 
Aslaksen, E. W. and Klauder, J. R. 

[1968] J. Math. Phys. 9, 206. 

[1969] J. Math. Phys. 10, 2267. 
Bargmann, V. 

[1954] Ann. Math. 59, 1. 

[1961] Commun. Pure Appl. Math. 14, 187. 
Bargmann, V., Butera, P., Girardello, L. and Klauder, J. R. 

[1971] Reps. Math. Phys. 2, 221. 
Barut, A.O. and Malin, S. 

[1968] Revs. Mod. Phys. 40, 632. 
Battle, G. 

[1987] Commun. Math. Phys. 110, 601. 
Berezin, F. A. 

[1966] The Method of Second Quantization, Academic Press, New 
York. 
Bergman, S. 

[1950] The Kernel Function and Conformal Mapping, Amer. 
Math. Soc, Providence. 
Bialinicki-Birula, I. and Mycielski, J. 

[1975] Commun. Math. Phys. 44, 129. 
Bleeker, D. 

[1981] Gauge Theory and Variational Principles, Addison- Wesley, 
Reading, MA. 
Bohr, N. and Rosenfeld, L. 

[1950] Phys. Rev. 78, 794. 
Bott, R. 

[1957] Ann. Math. 66, 203. 



238 



References 



Carey, A. L. 

[1976] Bull. Austr. Math. Soc. 15, 1. 
Daubechies, I. 

[1988a] "The wavelet transform, time-frequency localization and sig- 
nal analysis," to be published in IEEE Trans. Inf. Thy. 
[1988b] Commun. Pure Appl. Math. XLI, 909. 
De Bievre, S. 

[1989] J. Math. Phys. 30, 1401. 
Dell' Antonio, G. F. 

[1961] J. Math. Phys. 2, 759. 
Dirac, P. A. M. 

[1943] Quantum Electrodynamics, Commun. of the Dublin 

Inst, for Advanced Studies series A, #1. 
[1946] Developments in Quantum Electrodynamics, Commun. of 
the Dublin Inst, for Advanced Studies series A, #3. 
Dufflo, M. and Moore, C. C. 

[1976] J. Funct. Anal. 21, 209. 
Dyson, F. J. 

[1956] Phys. Rev. 102, 1217. 
Fock, V. A. 

[1928] Z. Phys. 49, 339. 
Gel'fand, I. M., Graev, M. I. and Vilenkin, N. Ya. 

[1966] Generalized Functions, vol 5, Academic Press, New York. 
Gel'fand, I. M. and Vilenkin, N. Ya. 

[1964] Generalized Functions, vol 4, Academic Press, New York. 
Gerlach, B., Gomes, D. and Petzold, J. 

[1967] Z. Physik 202, 401. 
Glauber, R. J. 
[1963a] Phys. Rev. 130, 2529. 
[1963b] Phys. Rev. 131, 2766. 
Glimm, J. and Jaffe, A. 

[1981] Quantum Physics, Springer, New York. 
Green, M. W., Schwarz, J. H. and Witten, E. 

[1987] Superstring Theory, vols. I, II, Cambridge Univ. Press, 
Cambridge. 
Greenberg, O. W. 

[1962] J. Math. Phys. 3, 859. 
Guillemin, V. and Sternberg, S. 

[1984] Symplectic Techniques in Physics, Cambridge Univ. 
Press, Cambridge. 
Hai, N. X. 

[1969] Commun. Math. Phys. 12, 331. 



References 



239 



Halmos, P. 

[1967] A Hilbert Space Problem Book, Van Nostrand, London. 
Hegerfeldt, G. C. 

[1985] Phys. Rev. Lett. 54, 2395. 
Helgason, S. 

[1962] Differential Geometry and Symmetric Spaces, 

Academic Press, NY. 
[1978] Differential Geometry, Lie Groups, and Symmetric Spaces, 

Academic Press, New York. 
[1984] Groups and Geometric Analysis, Academic Press, New York. 
Henley, E. M. and Thirring, W. 

[1962] Elementary Quantum Field Theory, 
McGraw-Hill, New York. 
Herman, G. T., ed. 

[1979] Image Reconstruction from Projections: Implementation 

and Applications, Topics in Applied Physics 32, Springer 
New York. 
Hermann, R. 

[1966] Lie Groups for Physicists, Benjamin, New York. 
Hille, E. 

[1972] Rocky Mountain J. Math. 2, 319. 
Inonii, E. and Wigner, E. P. 

[1953] Proc. Nat. Acad. Sci., USA, 39, 510. 
Itzykson, C. and Zuber, J.-B. 

[1980] Quantum Field Theory, McGraw-Hill, New York. 
Jost, R. 

[1965] The General Theory of Quantized Fields, Amer. Math. Soc, 
Providence. 
Kaiser, G. 

[1974] "Coherent States and Asymptotic Representations," in Proc. 

of Third Internat. Colloq. on Group-Theor. Meth. in 

Physics, Marseille. 
[1977a] " Relativistic Coherent-State Representations," in 

Group-Theoretical Methods in Physics, R. Sharp and B. 

Coleman, eds., Academic Press, New York. 
[1977b] J. Math. Phys. 18, 952. 

[1977c] Phase-Space Approach to Relativistic Quantum Mechanics, 
Ph. D. thesis, Mathematics Dept, Univ. of Toronto. 

[1977d] "Phase-Space Approach to Relativistic Quantum 

Mechanics," in proc. of Eighth Internat. Conf. on General 
Relativity, Waterloo, Ont. 

[1978a] J. Math. Phys. 19, 502. 



240 References 

[1978b] "Hardy spaces associated with the Poincare group," in Lie 
Theories and Their Applications, W. Rossmann, ed., 
Queens Univ. Press, Ont. 

[1978c] "Local Fourier Analysis and Synthesis," Univ. of Lowell 

preprint (unpublished). 
[1979] Lett. Math. Phys. 3, 61. 

[1980a] "Holomorphic gauge theory," in Geometric Methods in 

Mathematical Physics, G. Kaiser and J. E. Marsden, eds., 
Lecture Notes in Math. 775, Springer, Heidelberg. 

[1980b] "Relativistic quantum theory in complex spacetime," in Dif- 
ferential Geometrical Methods in Mathematical Physics, 
P.L. Garcia, A. Perez-Rendon and J.-M. Souriau, eds., 
Lecture Notes in Math. 836, Springer, Berlin. 
[1981] J. Math. Phys. 22, 705. 

[1984a] "A sampling theorem for signals in the joint time- frequency 
domain," Univ. of Lowell preprint (unpublished). 

[1984b] "On the relation between the formalisms of Prugovecki and 
Kaiser," Univ. of Lowell preprint, presented as a poster at 
the VH-th International Congress of Mathematical Physics, 
in Mathematical Physics VII, W. E. Brittin, K. E. Gustafson 
and W. Wyss, eds., North-Holland, Amsterdam 

[1987a] Ann. Phys. 173, 338. 

[1987b] "A phase-space formalism for quantized fields," in 

Group-Theoretical Methods in Physics, R. Gilmore, ed., 

World Scientific, Singapore. 
[1990a] "Algebraic Theory of Wavelets. I: Operational Calculus and 

Complex Structure," Technical Repors Series #17, Univ. of 

Lowell. 

[1990b] "Generalized Wavelet Transforms. I: The Windowed X-Ray 
Transform," Technical Repors Series #18, Univ. of Lowell. 
[1990c] "Generalized Wavelet Transforms. II: The Multivariate 

Analytic-Signal Transform," Technical Repors Series #19, 
Univ. of Lowell. 
Kirillov, A. A. 

[1976] Elements of the Theory of Representatins, Springer- 
Verlag, Heidelberg. 
Klauder, J. R. 

[1960] Ann. Phys. 11, 123. 
[1963a] J Math. Phys. 4, 1055. 
[1963b] J Math. Phys. 4, 1058. 
Klauder, J. R. and Skagerstam, B.-S., eds. 

[1985] Coherent States, World Scientific, Singapore. 



References 



241 



Klauder, J. R. and Sudarshan, E. C. G. 

[1968] Fundamentals of Quantum Optics, Benjamin, New York. 
Kobayashi, S. and Nomizu, K. 

[1963] Foundations of Differential Geometry, vol. I, Interscience, 
New York. 

[1969] Foundations of Differential Geometry, vol. II, 
Interscience, New York. 
Kostant, B. 

[1970] "Quantization and Unitary Representatins," in Lectures in 
Modern Analysis and Applications, R. M. Dudley et al., eds. 
Lecture Notes in Math. 170, Springer, Heidelberg. 
Lemarie, P. G. and Meyer, Y. 

[1986] Rev. Math. Iberoamericana 2, 1. 
Lurgat, F. 

[1964] Physics 1, 95. 
Mackey, G. 

[1955] The Theory of Group-Representations, Lecture Notes, Univ. 
of Chicago. 

[1968] Induced Representations of Groups and Quantum Mechan- 
ics, Benjamin, New York. 
Mallat, S. 

[1987] "Multiresolution approximation and wavelets," 

preprint, GRASP Lab., Dept. of Computer and Information 
Science, Univ. of Pennsylvania, to be published in Trans. 
Amer. Math. Soc. 
Marsden, J. E. 

[1981] Lectures on Geometric Methods in Mathematical Physics, 
Notes from a Conference held at Univ. of Lowell, CBMS- 
NSF Regional Conference Series in Applied Mathematics vol. 
37, SIAM, Philadelphia, PA. 
Meschkowski, H. 

[1962] Hilbertsche Raume mit Kernfunktion, Springer, Heidelberg. 
Messiah, A. 

[1963] Quantum Mechanics, vol. II, North-Holland, Amsterdam. 
Meyer, Y. 

[1985] "Principe d'incertitude, bases hilbertiennes et algebres 

d'operateurs," Seminaire Bourbaki 662. 
[1986] "Ondelettes et functions splines," Seminaire EDP, 

Ecole Poly technique, Paris, France (December issue). 
Misner, C, Thorne, K. S. and Wheeler, J. A. 

[1970] Gravitation, W. H. Freeman, San Francisco. 



242 



References 



Nelson, E. 

[1959] Ann. Math. 70, 572. 
[1973a] J. Funct. Anal. 12, 97. 
[1973b] J. Funct. Anal. 12, 211. 
Neumann, J. von 

[1931] Math. Ann. 104, 570. 
Newton, T. D. and Wigner, E. 

[1949] Rev. Mod. Phys. 21, 400. 
Onofri, E. 

[1975] J. Math. Phys. 16, 1087. 
Papoulis, A. 

[1962] The Fourier Integral and its Applications, McGraw-Hill. 
Perelomov, A. M. 

1972] Commun. Math. Phys. 26, 222. 
1986] Generalized Coherent States and Their Applications, 
Springer, Heidelberg. 
Prugovecki, E. 

1976] J. Phys. A9, 1851. 
1978] J. Math. Phys. 19, 2260. 

1984] Stochastic Quantum Mechanics and Quantum Spacetime, 
D. Reidel, Dordrecht. 
Robinson, D. W. 

[1962] Helv. Physica Acta 35, 403. 
Roederer, J. G. 

[1975] Introduction to the Physics and Psychophysics of Music, 
Springer, New York. 
Schrodinger, E. 

[1926] Naturwissenschaften 14, 664. 
Segal, I. E. 
[1956a] Trans. Amer. Math. Soc. 81, 106. 
[1956b] Ann. Math. 63, 160. 

[1958] Trans. Amer. Math. Soc. 88, 12. 
[1963a] Illinois J. Math. 6, 500. 

[1963b] Mathematical Problems of Relativistic Physics, Amer. 
Math. Soc, Providence. 
[1965] Bull. Amer. Math. Soc. 71, 419. 
Simms, D. J. and Woodhouse, N. M. J. 

[1976] Lectures on Geometric Quantization, Lecture Notes in Phys- 
ics 53, Springer Verlag, Heidelberg. 



References 



243 



Sniatycki, J. 

[1980] Geometric Quantization and Quantum mechanics, 
Springer, Heidelberg. 
Souriau, J.-M. 

[1970] Structure des Systemes Dynamiques, Dunod, Paris. 
Stein, E. 

[1970] Singular Integrals and Differentiability Properties of 
Functions, Princeton Univ. Press, Princeton. 
Strang, G. 

[1989] SIAM Rev. 31, 614. 
Streater, R. F. 

[1967] Commun. Math. Phys. 4, 217. 
Streater, R. F. and Wightman, A. S. 

[1964] PCT, Spin k Statistics, and All That, Benjamin, New York. 
Thirring, W. 

[1980] Quantum Mechanics of Large Systems, Springer, New York. 
Toller, M. 

[1978] Nuovo Cim. 44B, 67. 
Unterberger, A. 

[1988] Commun. in Partial Diff. Eqs. 13, 847. 
Varadarajan, V. S. 

[1968] Geometry of Quantum Theory, vol. I, D. Van Nostrand, 
Princeton, NJ. 

[1970] Geometry of Quantum Theory, vol. II, D. Van Nostrand, 
Princeton, NJ. 

[1974] Lie Groups, Lie Algebras, and their Representations, 
Prentice-Hall, Englewood Cliffs, NJ. 
Warner, F. 

[1971] Foundations of Differentiate Manifolds and Lie Groups, 
Scott, Foresman and Co., London. 
Wells, R. O. 

[1980] Differential Analysis on Complex Manifolds, Springer, New 
York. 

Weyl, H. 

1919] Ann. d. Physik 59, 101. 

1931] The Theory of Groups and Quantum Mechanics, translated 
by H. P. Robertson, Dover, New York 
Wightman, A. S. 

[1962] Rev. Mod. Phys. 34, 845. 
Yosida, G. 

[1971] Functional Analysis, Springer, Heidelberg. 



244 



References 



Young, R. M. 

[1980] An Introduction to Non-Harmonic Fourier Series, Academic 
Press, New York. 
Zakai, M. 

[1960] Information and Control 3, 101. 



